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BACKGROUND OF THE INVENTION 
A variety of biochemical processes, particularly those involving protein-protein 

interactions, are believed to be mediated by an induced conformational change in the protein 

target. The resulting structural change in the protein is then used to explain a modification in 

its function (e.g., enzymatic activity) or its affinity for another protein in the biological 

system. Conformational change has been proposed to occur in the cascade of steps associated 

with certain signal transduction pathways in eukaryotic cells. A ubiquitous component of 

such signal transduction pathways is a heterotrimeric guanine nucleotide-binding protein (G- 

protein) coupied to a cell surface receptor (for reviews see references 1-4 and 72). G-proteins 

relay signals initiated by various stimuli including photons, odorants, and a number of 

hormones and neurotransmitters. A variety of diseases are caused by defects in G-protein 

activity. G-proteins exist as heterotrimeric complexes of a, p, and y subunits. The a-subunit 

(Ga) is weakly bound to a dimer (Gpy) in which the P-subunit is tightly bound to the y- 

subunit. Ga is also associated with the intracellular carboxy terminal tail of a seven-helical 

transmembrane receptor. G-proteins transfer signals from more than 1000 receptors with 

various Ga subtypes regulating a variety of distinct downstream signaling pathways. 

Guanine nucleotide binding and GTPase function within the Ga domain to regulate the 

activity of G-proteins. 

The G-protein signaling process is typically initiated by the binding of an agonist to 

the cell surface receptor resulting in an induced conformational change in the G-protein. The 

G-protein structural change affects the guanine nucleotide affinity of Ga, so that it 

preferentially binds GTP and Mg 2+ over GDP. Numerous x-ray structures for G ia! during the 
various stages of the GTPase cycle have been used to identify regions of induced 
conformational change (5-8). In particular, the Ga guanine nucleotide binding site is 
composed of three distinct "switch" regions: residues V179-V185 in switch I, residues Q204- 
H213 in switch II and residues A235-N237 in switch HI, which undergo conformational 
changes upon GTP hydrolysis. The Ga surface that binds the Gpy dimer contains switch I 
and switch II regions. In the active Ga-GTP-Mg 2+ complex, a conformational change in 
switch I is associated with binding Mg 2+ , and switch II and switch m regions become well 
ordered due to ionic interactions between the two switch regions. As a result of the formation 
of the Ga-GTP-Mg 2 complex, modifications in the structure of the three "switch'* regions 
facilitate dissociation of Ga from GPy. The released subunits are then available to interact 



with a variety of target proteins to elicit the desired response. Termination of the signal 
results when. the process is reversed by the hydrolysis of GTP bound to Ga. Reassociation of 
Ga with Gpy results in the inactivation of the G-protein. Therefore the duration of the G- 
protein signal is directly dependent on the GTPase activity of the Ga protein. 

Regulators of G-protein signaling (RGS) affect the intensity and duration of the G- 
protein signal cascade by binding to the active Ga-GTP-Mg 2 complex and inducing a 50-fold 
increase in the rate of GTP hydrolysis (For reviews see references 9-13). Conversely, RGS 
proteins have little or no affinity for the inactive Ga-GDP complex. Thus, RGS proteins act 
as attenuators of the induced G-protein signal by increasing the rate of inactivation of the G- 
protein and termination of the signal. RGS proteins may exhibit additional biological 
function, e.g., RGS4 is reported to block activation of GTP-Ga by effectors (83). The RGS 
family, including RGS4, GAIP (human Ga-interacting protein), RGS1, RGS 10, and RGS 16, 
among others, contains more than 20 known members where specificity for Ga subtypes has 
been demonstrated and appears to be associated with subtle sequence differences (8, 14). The 
RGS family contains significant structural diversity, however, all RGS proteins are 
characterized by a conserved domain of about 130 amino acids which may be separated by 
linker regions of varying lengths. Recently, the RGS family has been reported to comprise at 
least six separate subfamilies designated A-F with unique structural features (Zheng, B. et al. 
(1999) (86). RGS4 exhibits structural features of RGS subfamily A. Subfamily-specific 
structural features may be associated with subfamily-specific functions, e.g., differences in 
Ga binding specificity among RGS proteins, membrane association of RGS protein, or 
functions exhibited by RGS proteins in addition to GAP activity. RGS4 is believed to 
function to attenuate induced G-protein by stabilizing the transition. 

RGS proteins are widely expressed in eukaryotic cells, including human cells (13). 
At least one RGS protein is found in tissue of each human organ and many tissues express 
multiple RGS proteins. Additionally, members of the RGS family are specifically expressed 
in the human brain, where RGS4 is perhaps the most widely distributed and highly expressed 
RGS subtype (15, 16). RGS expression has been correlated with a response to induced 
seizures, which indicates that regulation of RGS expression is an adaptive response in the 
brain signal transduction pathway to compensate for desensitization and sensitization of G- 
protein-coupled receptor function (16). In addition to regulation of the response to 
neurotransm iters, RGS activity has been associated with a variety of cellular functions 



including: cell proliferation, cell differentiation, membrane trafficking and embryonic 
development (9, 10, 12, 17). 

An x-ray structure of RGS4 bound to Gial [8], site-directed mutagenesis [18-20] and 
biochemical studies [17, 21] suggest that RGS4 binds preferentially to the Ga-GTP-Mg 2+ 
complex and stabilizes the transition state structure of the switch regions facilitating 
hydrolysis of GTP. Since the functional result of RGS4 binding to Gial is induction of GTP 
hydrolysis by Gial, it is reasonable to anticipate that the conformational change upon 
complex formation with RGS4 primarily occurs in G ictl . However, the x-ray crystal structure 
of Gial in the RGS4-Gial complex exhibits only a 0.6 A rms difference from that of Gial 
in Gial-AIF 4 " which is trapped in the proposed transition state for GTP hydrolysis. This 
comparison indicates that there is no significant conformational change in Gial . On the 
other hand, analysis of the RGS4-Gial complex x-ray structure indicates that RGS4 binding 
to Gial induces a decrease in the mobility of the switch regions of Gial. In these regions, 
critical interactions occur between N82 of RGS4 (employing the numbering of Figure 1) with 
the switch regions I and II of Gial and between T182 of Gial with a Get binding pocket on 
RGS4. The RGS4 residue N82 has been identified as critical for facilitating the intrinsic Gial 
GTPase activity presumably by stabilizing the switch regions and substrate binding (19, 20). 
Similar changes in the switch regions are observed between the Ga-GTP-Mg 2 complex and 
the Ga-GDP complex (2), suggesting that a conformational change in RGS4 may contribute 
to regulation of G-protein signaling. 

de Alba, E. et al. (1999) (87) reports the solution structure of human GAIP as 
determined by NMR techniques. The structure calculation used dipolar couplings of the 
oriented protein in two different liquid crystal media. The GAIP solution structure was 
compared to that of the rat RGS4-G i ct 1 x-ray structure (8). The reference suggests that GAIP- 
L187 participates in Ga-RGS binding and may also be important in the folding and stability 
of the RGS protein. It is also suggested that GAIP-S1 56 plays a role in GAIP stability. 
GAIP-S156 has been identified as a subfamily-specific residue for the RGS subfamily A 
[GAIP, Ret-RGSl, RGS21] (86) and Wang et al. (89). In RGS subfamily B which includes 
RGS4, the core amino acid corresponding to GAIP SI 56 is RGS N82 (as numbered in Fig. 1 
and N128 as numbered in Tesmer et al. (8)). The core region of GAIP is reported to have 



only 60% sequence identity to the core of RGS4. Any differences observed between these 
two structures are at least in part due to the differences in amino acid sequence. 

It is thus desirable to provide structural information for free RGS4 to better 
understand the mechanism of the regulation of G-protein signaling. More specifically, such 
structural information allows a direct comparison between the solution structure of RGS4 and 
the x-ray structure of the RGS4-Ga complex to determine which conformational changes 
occur in RGS proteins on binding to Got. The structural information and comparison can be 
employed to identify factors (chemical or biochemical species) that affect G- protein 
signaling by interaction with RGS proteins or their complexes with Ga. The structurally 
information can be of particular use in the identification and rational design of agonists and 
antagonists of free RGS and RGS/Ga complex activity, 

' SUMMARY OF THE INVENTION 

The present invention provides the three-dimensional solution structure of a free (i.e., 
not complexed) RGS protein of subfamily B, specifically that of free RGS4, as determined by 
NMR (nuclear magnetic resonance) spectroscopy. Particularly, the invention provides the 
three-dimensional solution structure of a Ga binding site of an RGS subfamily B protein. 
The Ga binding site of RGS subfamily B is exemplified by the three-dimensional structure of 
the RGS4 Gial binding site comprising the RGS 4 protein residues Dl 17, SI 18 and R121 . 
The invention also provides the three-dimensional structure of the a6 - <x7 region of a free 
RGS subclass B which region exhibits a significant conformational change on binding of 
RGS to Ga. Binding at the a6 - a7 region of RGS protein can effect the function of RGS 
protein in G-protein signaling. Further, the invention identifies and provided the three- 
dimensional structure of an allosteric binding site in an RGS protein. Binding at this 
allosteric site can affect the regulation of G-protein signaling. An allosteric binding site in 
the RGS protein is exemplified by the allosteric binding site in RGS4 located in the eel and 
a2 helical regions of free RGS4 and in the tight rum located between the two helical regions. 
More specifically, the allosteric binding site in RGS4 comprises the residues V10, W13, LI 7, 
120, H23, E24, C25 and T132. 

The three dimensional structure of free RGS4 in solution, including the Ga binding 
site, the C-terminus a 6 - a 7 region of free RGS4, and the allosteric binding site in free RGS4 
are provided by the relative atomic structural coordinates given in Table 2 as obtained by 



NMR spectroscopy. Also provided are the ,5 N, l3 C, ,3 CO and l H NMR assignments for free 
RGS4 (Table 1) which are employed in the determination of its secondary and three- 
dimensional structure. These assignments are also useful in methods for identifying or 
detecting chemical and biochemical species that bind to RGS and which can affect RGS 
5 function or RGS-Ga function and are particularly useful for identifying or detecting species 
that bind to RGS subclass B which can affect its function or RGS subfamily B-Ga function. 

The invention further provides a representation or model of all or part of the three- 
dimensional structure of a free RGS subfamily B protein comprising a data set of relative 
atomic structural coordinates embodying the three-dimensional structure of free RGS4 
10 protein. The invention also provides a data set of relative atomic structural coordinates 

embodying the three-dimensional structure of the Ga binding site in an RGS subfamily B 
protein. The invention further provides a data set of relative atomic structural coordinates 
embodvin? the re, - re- region nf an RGS suhfarnilv nrntpin Tn arMitirm thpt invpntion 

^ provides a data set of relative atomic structural coordinates embodying an allosteric binding 
site in an RGS subfamily B protein. The data set and any structural representation or model 
p*j of a free RGS subfamily B, its Ga binding site its a 6 - a 7 region or the allosteric binding site 
Uj in RGS subfamily B created or generated using the data set provided herein can be employed 
Uj to identify, select or rationally design factors, e.g., chemical or biochemical species, which 

* affect RGS function or activity or RGS/Ga complex activity or function. Further, the data 

w 

W set, structural representation or model of the Ga binding site can also be used to identify, 

pj 

^. select or rationally design species which affect Ga function by binding to Ga. The data set 
P and structural representations and models provided by this invention are particularly useful 

for the identification of agonists or antagonists of RGS function or RGS/Ga complex 

function or activity. 

25 The data set, including subsets of data embodying the Ga binding site, the a6 - a7 

region and the allosteric binding site, provided herein was determined by NMR analysis. 
However, any known method can be employed to provide the structural data. In one 
embodiment, the data set embodies the structure of free RGS4 in solution. In certain _ 
embodiments, the data set comprises one or more portions, of the structure of free RGS4. Of 

30 particular interest are those portions of the structure of RSG4 which function in RSG- 

regulation of G-protein signaling or more specifically which affect binding of RGS to Ga or 
which affect the biological function or activity of the RGS-Ga complex. Also of interest are 



6 



those portions of the structure of RSG4 to which candidate agonists and antagonists of RSG 
bindto affect its biological function- 
Any available method may be used to construct a structural representation or model 
from the NMR-derived data disclosed herein or from data obtained from an independent 
5 structural analysis of free RGS4. Such a model or representation can be generated or 

constructed from the available analytical data points using software packages such as HKL, 
CHARMM, MOSFILM, XDS, CCP4, SHARP, PHASES, HEAVY, XPLOR, TNT, 
NMRCOMPASS, NMRPIPE, DIANA, NMRDRAW, FELIX, VNMR, MADIGRAS, 
QUANTA, BUSTER, SOLVE, O, FRODO, XPLOR, RASMOL, and CHAIN, all of which 
10 are well-known and available to those in the art. A structural representation or model can be 
generated from these data using available systems, including, for example, Silicon Graphics, 
Evans and Sutherland, SUN, Hewlett Packard, Apple Macintosh, DEC, IBM, and Compaq 
systems. The structural representation or model can be displayed or generated in any two- 
El dimensional or three-dimensional form known in the art for viewing, analyzing, modeling or 
Jf5 otherwise representing the structure. The structural representation can be transmitted, 

SI 

j conveyed or stored in any known graphic, digital or analog form. Structural representations 

l y 

O or models generated with the RGS data provided herein can be combined with structural 

ip. 

y| representations of other chemical and biochemical species (e.g., candidate antagonists or 

L agonists) including x-ray data of RGS-complexes, in order to analyze potential interactions 

5,0 between RGS, particularly RGS subfamily B proteins, and Got and those species. The data 

"i; provided herein may also be combined, as illustrated herein, with structural information 

Q (including x-ray data) of RGS-complexes, particularly RGS subclass B protein-complexes 

M= • 

and particularly those complexes believed to be or believed to model biologically functional 

complexes. 

25 The present invention relates to the structural data for free RGS4, the Ga binding site 

of RGS4, the a6 - a 7 region whose conformation changes on binding of RGS4 to Ga, and 
allosteric binding sites in RGS4 in any form (for example in digital, tabular, graphic, or 
pictorial form or as embodied in any representation or model or as embodied in a computer 
storage medium) and the use of the data (in whatever form) for generating a structural 

30 representation or model of free RGS, particularly an RGS subfamily B protein, more 

particularly RGS4, or of the interaction of RGS, an RGS subfamily B protein, and RGS4, 
with any other chemical or biochemical species, including structural representations or 
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models of RGS interaction with G-protein subunits and of RGS interaction with potential 
agonists or antagonists of RGS function. 

The present invention also provides for a computer system which comprises the 
structural representation or model of the invention and hardware used for construction, 
5 processing and/or visualization of the model of the invention. The solution structural 
coordinates of RGS4 or portions thereof as provided herein can be stored in or on an 
appropriate medium for introduction into or use with any computer program or system for 
generating a representation or model of the structure of an RGS protein, an RGS subclass B 
protein or RGS4, or for analysis of the interaction of RGS with other chemical or biochemcial 
10 species. 

The structural coordinates can be stored in a machine-readable form on a machine- 
readable storage medium, for example, a computer hard drive, diskette, DAT tape, etc., for 
display as a three-dimensional shape or for other uses involving computer-assisted 
g manipulation of, or computation based on, the structural coordinates or the three-dimensional 
J5 structures they define. By way of example, the data defining the three dimensional structure 

=P of RGS4 of the present invention, or of a portion of RGS4 as disclosed herein, may be stored 
fU 

Q in a machine-readable storage medium, and may be displayed as a graphical three- 
dimensional representation of the relevant structural coordinates, typically using a computer 
capable of reading the data from said storage medium and programmed with instructions for 
|Sp creating the representation from such data. 
fl ! Accordingly, the present invention provides a machine, such as a computer, 

p programmed in memory with the coordinates of the RGS4 or RGS subfamily B protein, or 
*" portions thereof (such as, by way of example, the coordinates of the RSG4 Got binding site, 

the cc6 - a 7 region of RGS4, or the allosteric binding site in the al-<x2 region of RGS4), 
25 together with a program capable of converting the coordinates into a three dimensional 

graphical representation of the structural coordinates on a display connected to the machine. 
A machine having a memory containing such data aids in the rational design or selection of 
inhibitors or activators of RGS, Ga or RGS-Ga complex activity, including the evaluation of 
the ability of a particular chemical or biochemical species to favorably associate with RGS, 
30 particularly an RGS subclass B protein, as well as in the modeling of compounds, proteins, 

complexes, etc. related by significant structural or sequence homology to RGS4 or other RGS 
proteins. 
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The present invention is additionally directed to a method of determining the three 
dimensionaLsoIution structure of a compound, e.g., a protein or peptide or other chemical or 
biochemical species (including RGS proteins or portions thereof, or more specifically, RGS 
subfamily B proteins or portions thereof that are not RGS4) whose structure is unknown, 
comprising the steps of first obtaining a solution of the protein or peptide whose structure is 
unknown, and then generating NMR data from this solution. The NMR data from the protein 
or peptide can then be compared with the known three dimensional structure of RGS4 (or 
portion thereof, e.g., the Ga binding site) as disclosed herein, and the three dimensional 
structure of the protein or peptide whose structure is unknown conformed to the known RGS4 
structure using standard techniques, such as 2D, 3D and 4D isotope filtering, editing and 
triple resonance NMR techniques, computer homology modeling as well an adaptation of 
molecular replacement techniques as applied to NMR data. Alternatively, a three 
dimensional model of a protein or peptide of unknown structure, but related by sequence 
similarity to RGS4, may be generated by initial sequence alignment between RGS4 and the 
protein or peptide, based on any or all amino acid sequence identity, secondary structure 
elements or tertiary folds, and then generating by computer modeling a three dimensional 
structure for the molecule using the known three dimensional structure of, and sequence 
alignment with, RGS4. 

Methods are also provided for identifying a species which is an agonist or antagonist 
of RGS activity, RGS binding to Ga, Ga binding to RGS, or RGS/Ga complex activity, 
particularly for RGS subfamily B proteins. The method comprises the steps of using a three 
dimensional structure of free RGS subfamily B protein or a portion (e.g., an RGS4 core 
protein) thereof as defined by the relative structural coordinates of amino acids encoding the 
RGS4-core protein to design or select a potential agonist or antagonist, and synthesizing or 
otherwise obtaining the potential agonist or antagonist. The potential agonist or antagonist 
may be selected by screening an appropriate database, may be designed de novo by analyzing 
the steric configurations and charge potentials of the RGS4 Ga binding site, the a 6 - a 7 region 
of RGS4, or an allosteric binding site of RGS4 in conjunction with the appropriate sojtware 
programs, or may be designed using characteristics of known agonists or antagonists of 
RGS4, RGS subfamily B, or other RGS proteins in order to create "hybrid" agonists or 
antagonists. The method of the present invention is preferably used to design or select 
inhibitors of RGS subfamily B proteins, or RGS subclass B-Ga complex activity, and 



specifically RGS4 or RGS4-Gicc 1 complex activity. In a specific embodiment, the potential 
agonist or antagonist is identified, selected or designed by studying the interaction of 
candidate species with a three-dimensional model of RGS4 (or a portion thereof) or a three- 
dimensional model of another RGS subfamily B protein (or model thereof) and selecting a 
species which is predicted by its interaction with the RGS protein or a portion of an RGS 
protein to act as an agonist or antagonist. Potential antagonists and agonists can be readily 
tested using various procedures disclosed herein or known in the art to confirm their 
antagonist. or agonist function. Species identified in accordance with such methods are also 
provided. 

Other specific embodiments provide: (1) a process of identifying a substance that 
inhibits RGS4 activity, RGS4 binding to Gial, Gial binding to RGS 4 or RGS4/Gial 
complex activity comprising determining the interaction between a candidate substance and a 
model of all of part of the structure of free RGS4, or (2) a process of identifying a substance 
that mimics or promotes RGS4 activity, RGS4 binding to Gial, Gial binding to RGS4 or 
RGS4/Ga complex activity comprising determining the interaction between a candidate 
substance and a model of all or part of the structure of free RGS4 by analyzing the steric 
configuration and charge potential of free RGS4 and comparing these properties to those of a 
candidate substance. Substances identified in accordance with such processes are also 
provided. 

Other embodiments provide a method of identifying antagonists or agonists of RGS 
activity, RGS binding to Ga, Ga binding to RGS or RGS/Ga complex activity by rational drug 
design comprising: (a) designing a potential antagonist or agonist that will. form a reversible or 
non-reversible complex with one or more amino acids in the RGS Ga binding site based upon 
the structure coordinates of free RGS4; (b) synthesizing or otherwise obtaining the antagonist 
or agonist; and (c) determining whether the potential antagonist or agonist inhibits or promotes 
the activity or binding of RGS or the activity of the RGS-Ga complex. In other preferred 
embodiments, the antagonist or agonist is designed to interact with one or more atoms of one or 
more amino acids in the RGS4-Gialbinding site. More specifically, the antagonist or agonist 
is designed to interact with amino acids selected from the group consisting of Dl 17, SI 18, or 
R121 of RGS4, other amino acids associated with the Ga binding site and other amino acids 
revealed by the determined structure. Yet more specifically, the antagonist or agonist is 
designed to interact with amino acids selected from the group consisting of S39, E41, N42, 
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L113, DU7, SI 18, R121 or N82 of RGS4. Substances identified in accordance with such 
processes are also provided. The agonist or antagonist may form a covalent or non-covalent 
bond with an RGS protein. This method is specifically applicable to identifying antagonists or 
agonists of RGS subfamily B proteins. 
5 Other specific embodiments provide a method of identifying antagonists or agonists of 

RGS activity or RGS/Ga complex activity by rational drug design comprising: (a) designing a 
potential antagonist or agonist that will form a reversible or non-reversible complex with one or 
more amino acids in a ct6 - a 7 region of RGS based upon the structure co-ordinates of free 
RGS4; (b) synthesizing the antagonist or agonist; and (c) determining whether the potential 
10 antagonist or agonist inhibits or promotes the activity of RGS or RGS/Ga complex. In preferred 
embodiments, the antagonist or agonist is designed to prevent or facilitate conformation change 
in these regions on binding to Ga. This method is specifically applicable to identifying 

— tr ~- ^ T T> r~l<C £ .,,kf'nT-m"l'w "R r*rr\tf*in orti-wiHv r\r T? Pr's cnhfbmilv R/flrY mmnlpY 

tUlta^UlUDU Ul a^UlliJlO V/l AV^JU jui/iwhh; j_ f - — — ~ ^ ■ * j — j 

^ activity. 

^ 5 Other specific embodiments provide a method of identifying antagonists or agonists of 

j^j RGS activity or RGS/Ga complex activity by rational drug design comprising: (a) designing a 

S potential antagonist or agonist that will form a reversible or non-reversible complex with one or 

Lfi 

y*l more amino acids in an RGS4 allosteric binding site based upon the structure co-ordinates of free 

!L RGS4; (b) synthesizing the antagonist or agonist; and (c) determining whether the potential 

y20 antagonist or agonist inhibits or promotes the activity of RGS or RGS/Ga complex. In preferred 

J| embodiments, the antagonist or agonist is designed to interact with the allosteric binding site in 

O the a r a 2 region of RGS4. In yet other preferred embodiments^the antagonist or agonist is 

1=3: 

designed to interact with one or more atoms of one or more amino acids in the allosteric binding 
site in the a t and a-, region of RGS4, and particularly with one or more atoms of amino acids 
25 V10, W13, L17, L20, H23, E24, C25, or T132 of RGS4. Substances identified in accordance 
with such processes are also provided. This method is specifically applicable to identifying 
antagonists or agonists of RGS subclass B protein activity or RGS subclass B/Ga complex 
activity. 

Candidate agonists and antagonists of RGS, RGS-Ga complexes can be selected from 
30 any type of small molecule, dimer, multimer, oligomer, or polymer of natural or non-natural 
origin that is obtained from any source and may be isolated from a natural source or chemically 
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or biologically synthesized. Candidate antagonists and agonists can include nucleic acids, 
peptides, polypeptides, proteins, and various small organic molecules. 

The study of the interaction of the candidate species with the three-dimensional 
structure of RGS and/or portions of that structure can be performed using available software 
5 platforms, including QUANTA, RASMOL, O, CHAIN, FRODO, INSIGHT, DOCK, 
MCSS/HOOK, CHARMM, LEAPFROG, CAVEAT(UC Berkley), CAVEAT(MSI), 
MODELLER, CATALYST, and ISIS. 

The invention also provides a method for identifying the presence of and determining 
the location of allosteric binding sites in RGS4. The method comprises the steps of 
10 contacting free RGS4-core in solution with test compounds that are members of a library of 
chemical species which encompass a range of structural features or which are known to 
inhibit RGS function; measuring the 'H, 15 N, and/or l3 C NMR spectra of the RGS4-core in 
the presence of test compounds of the library to detect any perturbations in the chemical 
J ; shifts of RGS4-core that are induced by binding of a test compound to RGS4-core, and 

"45 determining if binding of the test compound affects RGS activity. This can be done, for 
ry example, by assessing the affect of the test compound on RGS induced Got GTPase activity. 

y| The amino acid residues of RGS4-core that are affected by binding of the test compound 

U1 define the binding site of the test compound. If the test compound is found to affect RGS4- 

£=, core activity and the location to which the test compound binds in RGS4-core is not the Get 
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§0 binding site, then the location to which the test compound binds is an allosteric binding site. 

p| 

kf; One such allosteric binding site in the <x\-a2 region of RGS4-core has been identified using 

~' this method. 

H : 

The three-dimensional structure of any allosteric binding site identified by this 
method can then be employed in methods described herein to identify, select and design 

25 candidate agonists and antagonists of RGS activity and specifically of RGS subclass B 
activity. Test compounds for assessing the presence of allosteric sites in RGS can be 
members of a library that exhibit a range of structural feature (e.g., alicyclic rings, 
heterocyclic rings, aromatic rings, aliphatic, alicyclic compounds or aromatic compounds 
displaying various substituent groups (e.g., OH, -CO-,- NHCO-, etc.). Test compounds can 

30 also be selected in screens for compounds that are known to exhibit an affect on RGS activity 
(e.g., that enhance or retard the rate of RGS4-induced Ga GTPase). Initial screens can be 
performed by assessing mixtures containing a plurality of test compounds for an affect on 
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RGS activity. In cases in which an affect is observed with the mixture of test compounds, the 
individual compounds can be re-tested individually to determine which test compound(s) 
affect RGS activity. 

In a specific embodiment, the invention provides a method in which the three 
dimensional structure of free RGS4-core is employed to identify chemical or biochemical 
species or fragments thereof capable of binding to an RGS protein. Once identified the 
species or fragments capable of binding to RGS are assembled (using well-known computer 
modeling techniques) into a single molecule to provide a structure of a potential antagonist or 
agonist. The molecule assembled can contain additional species or fragments (e.g., a 
backbone) for desired orientation of the species or fragments capable of binding to RGS. 
This method is particularly applicable to RGS subfamily B proteins. 

The invention further provides a method for identifying mutants of RGS4 proteins in 
which the activity of the mutant protein is different from that of RGS4. In this method the 
three-dimensional structure of free RGS4 is employed to identify amino acids that are 
involved in the regulation of G-protein signaling. One or more of the amino acid residues 
identified are then modified to generate a mutant RGS4. Mutants identified in this method 
are expected to exhibit altered function in the regulation of G-protein signaling. 

Other objects of the invention will be readily apparent from the following detailed 
description. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 illustrates the secondary structure of RGS4. The figure provides a summary 
of the sequential and medium range NOEs involving the NH, Ha and Hp protons, the amide 
exchange and 3 JHNa coupling constant data, and the 13 Cct and l3 Cp secondary chemical 
shifts observed for RGS4 with the secondary structure deduced from this data. The thickness 
of the lines reflects the strength of the NOEs. Amide protons still present after exchange to 
D2O are indicated by closed circles. The open boxes represent potential sequential 
assignments NOEs which are obscured by resonance overlap and could therefore no^be 
assigned unambiguously. The gray boxes on the same line as the Hct(i)-NH(i+l) NOEs 
represents the sequential NOE between the Ha proton of residue i and the C5H proton of the 
i+1 proline and is indicative of a trans proline. Seven alpha helical regions are indicated (al- 
ec?). 
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Figures 2A and 2B are ribbon diagrams of the (A) x-ray structure of RGS4 from the 
RGS4-Gial complex, (B) NMR structure of free RGS4 for residues V5 to PI 34. The residues 
which exhibit a significant structural change between the RGS4-Gial x-ray structure and the 
free'RGS4 NMR structure are numbered. Residues Kl 16-Y1 19 correspond to key residues 
involved in the interaction with Gia 1 and the location of a structural change between the free 
and complexed forms of RGS are indicated. The C- and N-terminal regions which exhibit a 
change in secondary structure and helical packing are also indicated. The RGS4-Giccl x-ray 
structure is that of Tesmer et al. (8). The C- and N-terminal regions which incur a change in 
secondary structure and helical packing are indicated. The observed helical regions of the 
RGS4 structure are labeled. 

DETAILED DESCRIPTION OF THE INVENTION 
F.GS proteins are regulators of G-protein signaling which affect the intensity and 
duration of the G-protein signal cascade by binding to the active Ga-GTP-Mg 2 complex to 
increase therate of GTP hydrolysis. RGS proteins act as attenuators of the induced G-protein 
signal by increasing the rate of inactivation of the G-protein and termination of the signal. 
RGS proteins have been identified in a wide range of eukaryotes, including humans. RGS 
proteins are highly diverse, multifunctional proteins characterized by the presence of a core 
region of approximately 130 amino acid residues (sometimes identified as having 120 amino 
acids), which may be separated by linker regions of varying lengths (79, 80, 9), that is 
conserved in all RGS proteins that have so far been identified. All RGS proteins that have 
been identified bind to members of the Gia class of G protein a surpunits. The family of RGS 
proteins include RGS4, GAIP (human Gcc-interacting protein), RGS 1, RGS 10, RGS1 1, 
RGS 12, RGS 13, RGS 14, and RGS 16 (also called RGSr), Axin, Conductin, pi 15-RhoGEF, 
PD2-RhoGEF and LSC (36), among others, and contains more than 20 known members 
where specificity for Get subtypes has been demonstrated and appears to be associated with 
subtle sequence differences (8, 14). RGS4 is believed to stabilize the transition state for GTP 
hydrolysis (17, 57, 21). The conserved region of RGS provides for binding to Gcc and can 
thus be used to identify species that affect (as agonists or antagonists) RGS binding to Ga and 
the activity of RGS as an attenuator of G-protein signaling. RGS proteins of this invention 
function in G-protein regulation by binding to the Ga subunit of a G-protein. RGS proteins 
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may, but need not, exhibit other biological functions. References 89 and 91 provided reviews 
of additional-biological functions exhibited by RGS proteins. 

The term "RGS protein" as used herein, including its use for specific RGS proteins 
and RGS protein subfamilies, includes native RGS proteins (and native RGS core proteins) 
5 isolated from or otherwise obtained from (e.g.,_by expression of cloned genes) from any 

natural sources, recombinant RGS proteins which may contain portions of RGS sequence and 
non-RGS sequence (e.g., RGS-core sequence with the hexahis pro-tag), variant RGS proteins 
which contain conservative amino acid sequence differences from a native RGS protein or in 
which sequences non- functional in RGS activity are deleted, mutant RGS proteins in which 
10 one or more amino acids have been modified by expression from a mutant RGS coding 

sequence. Mutants include, among others, those having one or more site specific mutations, 
those having one or more deletions and those having one or more insertions compared to a 

rmtivp T? f^s nrntpin (r\r t? n^.r-nrp^ c\r unnant T? C^^Z { r\r uorionf Dr.Q.rnro^ TV,o term nnnMnt 
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RGS refers in particular to those proteins having the described mutations, insertions or 
^j|5 deletions in the RGS core region. Variant RGS proteins are expected to have biological 
fl! function for G-protein regulation substantially the same as that of the native RGS protein 

o 

ipi from which they are derived. Mutant RGS proteins include those which have biological 

^ i function substantially the same as or modified from that of a native or variant RGS protein 

e 

Q from which they are derived. Variant, derivative, recombinant and mutant RGS proteins do 

pgO not necessarily represent mutually exclusive subsets of proteins. 

UD As noted herein, the RGS-core region is involved with RGS function in G-protein 

^ regulation, the term RGS proteins as used herein include RGS proteins in which non- 

functionaj regions are absent, e.g., RGS-core regions of native, recombinant, variant or 
mutant RGS proteins. The RGS core region of a native RGS has been found to retain full 
25 native RGS activity (8). The core region of RGS4 is approximately 130 amino acids in 

length. (References may also refer to conserved or core regions of RGS as having a length of 
approximately 120 amino acid) RGS cores from other RGS proteins can differ in length from 
that of RGS4. RGS proteins of this invention can be obtained by in vitro or in vivo 
expression of an RGS coding sequence by isolation from natural sources or any other means 
30 known in the art. 

Known RGS proteins are categorized into six or seven subfamilies on the basis of a 
phylo-genetic analysis of 61 mammalian and invertebrate RGS proteins (86). Mammalian 
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RGS proteins are composed of at least six subfamilies designated A-F as follows: A (GAIP, 
Ret-RGSl, RGS21); B (RGS I , RGS2, RGS3, RGS4, RGS5, RGS8, RGS 1 3 and RGS 16 [also 
called RGS-r]; C (RGS6, RGS7, RGS 9 and RGS1 1); D (RGS 12, RGS 14); E (Axin and 
Conducting and F (pi 1 5-RhoGEF, PD2-RhoGEF and Lsc). Two other RGS proteins TGS 10 
and D-AKAP2 are structurally diverse from those of subfamilies A-F and may represent a 
separate subfamily. Subfamilies B, C and D all have characteristic residue Asn (N82 in 
RGS4 as numbered herein, or N 128 as numbered in Tesmer et al. [8]), which is associated 
with Gcc binding at least in RGS subfamily B proteins. RGS proteins of subfamily A are 
substituted at this position in the RGS core with a serine (SI 56 in GAIP). Additionally, the B 
subfamily of RGS proteins is reported to have another characteristic residue, a serine (at 
position 57 in RGS4 as numbered herein and SI 03 as numbered in Tesmer et al. [8]). RGS4 
represents the B subfamily of RGS proteins and is structurally more similar to and believed to 
have biological activity and function more similar to other members of the B subfamily, 
including RGS1, RGS 2, RGS3, RGS5, RGS8, RGS 13 and RGS 16. GAIP, for example, is 
representati ve of the A subfamily of RGS proteins and is structurally more similar to and 
believed to have biological activity and function more similar to other members of the A 
subfamily including Ret-RGSl and RGSZ1. Thus, the term RGS subfamily B refers to 
RGS1, RGS 2, RGS3, RGS4, RGS 5, RGS8, RGS 13, RGS 16 and other, as yet 
uncharacterized, RGS proteins that exhibit structural features characteristic- of the B 
subfamily and which are classifiable into the B subfamily by phylogenetic analysis as 
described in Zheng, B. et al. (1999) supra . Analogously, the term RGS subfamily A refers to 
GAIP, Ret-RGSl and RGS2 1 and other RGS proteins as yet uncharacterized that exhibit 
structural characteristics of the A subfamily and which are classified as RGS subfamily A 
proteins by phylogenetic analysis. RGS subfamilies C-F have analogous definitions. 

The term RGS4 refers to RGS4 exemplified by RGS4 of rat (Tesmer et al (1997) 
supra ) and homologs thereof including, among others, human RGS4 and mouse RGS4. The 
RGS core region of human, rat and mouse RGS4 differ from one another by 2-4 amino acids 
(representing about 97% or more sequence identify in the 130 amino acid core). Hornologs 
of GAIP can exhibit as low as about 85% sequence identity in the RGS core region. An 
RGS4 homolog may, thus, exhibit RGS4 core sequence identify as low as about 85% with rat 
RGS4. 



16 



RGS protein NMR studies and structural determinations herein were performed using 
an RGS4 -core protein consisting of the conserved region of RGS4 (specifically that derived 
from rat) with a N-terminal methionine and a C-terminal hexahistidine tail. The three- 
dimensional solution structure determined for the RGS4-core protein, assuming the 
possibility of conservative amino acids changes and within ± a root mean square deviation of 
the relative structural coordinates of the backbone atoms listed in Table 2 of not more than 
1.5 A (or more preferably, not more than 1.0A, or most preferably, not more than 0.5 A), 
model the three-dimensional solution structures of other RGS4 proteins of any eukaryotic 
origin, including human RGS4. Further, because of the significant conservation of this 
domain among different RGS proteins, the three-dimensional structure of RGS4-core 
provided herein, again assuming conservative amino acids changes, and within ± a root mean 
square deviation of the relative structural coordinates of the backbone atoms of the structure 

01 noi more uiaii i.j/a \kjl iuvig piciciauij^, u\j\. ^ uitui i . «i mv^i jjiv 4 w«u. ; , **w* — - 
than 0.5 A), models the structures of the conserved region in other RGS proteins of all origins. 

"Structural coordinates'* are the Cartesian coordinates corresponding to an atom's 
spatial relationship to other atoms in a molecule or molecular complex. Structural 
coordinates may be obtained using NMR techniques, as described herein or as known in the 
art, or using x-ray crystallography as is known in the art. Alternatively, structural coordinates 
can be derived using molecular replacement analysis or homology modeling. Various 
software programs allow for the graphical representation of a set of structural coordinates to 
obtain a three dimensional representation of a molecule or molecular complex. The structural 
coordinates of the present invention may be modified from the original sets provided in Table 

2 by mathematical manipulation, such as by inversion or integer additions or subtractions. As 
such, it is recognized that the structural coordinates of the present invention are relative, and 
are in no way specifically limited by the actual x, y, z coordinates of Table 2. The structural - 
coordinates of Table 2 ± a root mean square deviation from the conserved backbone atoms of 
the amino acids therein(or conservative substitutions thereof) of not more than 1.5 A (or more 
preferably, not more than 1.0A, or most preferably, not more than 0.5 A) define or embody the 
three-dimensional structure of free RGS4 (i.e., not complexed with another molecule) in 
solution. The RGS4 core conserved region contains a Get binding site and an allosteric 
binding site. Amino acid sequences can be inserted between the helical regions of the RGS 
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core region without significantly altering the biological function of the RGS protein. RGS 
proteins of lower eukaryotes contain such insertions. 

"Root mean square deviation" is the square root of the arithmetic mean of the squares 
of the deviations from the mean, and is a way of expressing deviation or variation from the 
structural coordinates described herein. 

As used herein, "RGS activity," "activity of RGS" and other similar terms refer to the 
ability of RGS to bind to an active Gct-GTP-Mg 2 complex and induce a change in the rate of 
GTP hydrolysis. Any other biological function or activity of an individual RGS protein will 
be specifically defined herein. References 89 and 91 are incorporated by reference herein for 
their review of the additional biological functions of certain RGS proteins. Any assay which 
measures the rate of GTP hydrolysis in a Gct-GTP-Mg 2 complex in the presence and absence 
of RGS (or portions thereof) can be used to measure such activity. A preferred assay method 
.^;^;t,t^ r-o^^iok^i^rl nhncnh^tp that r^iilK from hvdrnlvsis of Ga-rv- 32 Pl- 
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GTP-Mg 2 as described in the Examples herein. 

Table 2 lists the atomic structure coordinates for the restrained minimized mean 
structure of free RSG4 as derived by NMR spectroscopy. The first two columns in Table 2 
list atom number, the third column identifies the atom type using standard nomenclature, the 
fourth and fifth columns list the amino acid and its number in the sequence. The sixth, 
seventh and eighth columns of the table are relative coordinate values (in three dimensions). 

It will be obvious to the skilled practitioner that the numbering of the amino acid 
residues in the RGS4 and other RSG proteins covered by the present invention may be 
different than that set forth herein. The RGS4 core protein used herein contains an RGS core 
domain with an N-terminal Met and a six residue histidine tag at the C-terminus. In Fig. 1 
the amino acid sequence of the RGS4 core protein used is numbered beginning at the N- 
terminal Met. For comparison to the full-length RGS4 sequence (for example, as numbered 
in Tesmer et al. (1997) (8)) add 46 to the numbering used herein. 

It will also be obvious to the skilled practitioner that RGS proteins and portions 
thereof covered by this invention may contain certain conservative amino acid substitutions 
that yield the same three dimensional structures as those defined by the structural coordinates 
provided herein ± a root mean square deviation from the conserved backbone atoms of the 
amino acids therein(or conservative substitutions thereof) of not more than 1.5 A. Amino 
acids in other RGS proteins or peptides corresponding to those in RGS4 and conservative 



18 



substitutions in other RGS proteins or peptides are readily identified by visual inspection of 
the*elevant-amino acid sequences or by using commercially available homology software 
programs. "Conservative substitutions" are those amino acid substitutions which are 
functionally equivalent to the substituted amino acid residue, either by way of having similar 
polarity, steric arrangement, or by belonging to the same class as the substituted residue (e.g., 
hydrophobic, acidic or basic), and includes substitutions having an inconsequential effect on 
the three dimensional structure of RGS with respect to the use of said structure for the 
identification and design of RGS antagonists or agonists, and for molecular replacement 
analyses and/or for homology modeling. 

The structural coordinates of the present invention permit the use of various molecular 
design and analysis techniques in order to (i) solve the three dimensional structures of related 
RGS proteins, peptides or complexes thereof, and particularly RGS subfamily B proteins, 
peptides or complexes thereof and (ii) to select, design and synthesize or otherwise obtain 
chemical and biochemical species capable of associating, binding or interacting with RGS 
potentially having function as antagonists or agonists of an RGS, Got or an RGS-Ga complex. 

Molecular replacement analysis is a well-known technique employed in x-ray 
crystallography which uses the x-ray structure of a molecule having as a starting point to 
model a molecule whose crystal structure is unknown. This technique is based on the 
principle that two molecules which have similar structures, orientations and positions will 
diffract x-rays similarly. A corresponding approach to molecular replacement is applicable to 
modeling an unknown solution structure using NMR technology. The NMR spectra and 
resulting analysis of the NMR data for two similar structures will be essentially identical for 
regions of the molecules that are structurally conserved, where the NMR analysis consists of 
obtaining the NMR resonance assignments and the structural constraint assignments, which 
may contain hydrogen bond, distance, dihedral angle, coupling constant, chemical shift and 
dipolar coupling constant constraints. Appropriate NMR spectra are accumulated for a 
solution of the species of unknown structure and compared to NMR of the species of known 
structure. The observed differences in the NMR spectra of the two structures will highlight 
the differences (and similarities) between the two structures and identify the corresponding 
differences in the structural constraints. The structure determination process for the unknown 
structure is then based on modifying the NMR constraints from the known structure to be 
consistent with the observed spectral differences. This method is applicable to the 
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determination of three-dimensional solution structures of any RGS protein or peptide using 
the structural information for RGS4 provided herein. The method is most appropriate for 
determining the structures of RGS proteins that are expected to have significant structural 
similarity with RGS4. For example, this invention specifically provides the three- 
dimensional structure of a rat RGS4-core region in solution. The replacement method 
described above can be employed to determine the three-dimensional structure of the human 
RGS4-core which differs from that of rat by 2 amino acids in the 130 RGS core (at positions 
22 N (rat) >S (human), and 132 T(rat) > V(human), referring to the rat sequence given in Fig. 
1. 

Accordingly, in one nonlimiting embodiment of the invention, the NMR resonance 
assignments for RGS4 provide the starting point for resonance assignments of other RGS 
family proteins (or portions thereof), that are expected to be structurally similar to RGS4, 
e.g.. RGS 4 homoloes from different organisms or more generally RGS proteins of the 
subfamily B. Chemical shift perturbations can be detected using one or two dimensional 
spectra (e.g.-, I5 N/ l H, 13 C/*H spectra) or using other methods well known in the art and 
compared between RGS4 and another RGS protein. In this way, the affected residues may be 
correlated with the three dimensional structure of RGS4 as provided by the relevant residues 
of Table 2. This effectively identifies the region of the other RGS protein or peptide that has 
a structural change relative to the RGS4 protein. The J H, I5 N, 13 C and I3 CO NMR resonance 
assignments corresponding to both the sequential backbone and side-chain amino acid 
assignments of the other RGS protein, or portion thereof, can then be obtained and the three 
dimensional structure of this protein, or portion thereof, can be generated using standard 2D, 
3D and 4D triple resonance NMR techniques and NMR assignment methodology, using the 
RGS4 structure, resonance assignments and structural constraints as a reference. Various 
computer fitting analyses of the other RGS protein or peptide with the three dimensional 
model of RGS4 can be performed in order to generate an initial three dimensional model of 
the other RGS protein or peptide, and the resulting three dimensional model may be refined 
using standard experimental constraints and energy minimization techniques in order to 
position and orient the other RGS in association with the three dimensional structure of 
RGS4. * 

The present invention further provides that the structural coordinates of the present 
invention can be used with standard homology modeling techniques in order to determine the 
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unknown three-dimensional structure of an RGS protein or portion thereof. Homology 
modeling, as is known in the art, involves constructing a model of an unknown structure 
using structural coordinates of one or more related protein molecules, molecular complexes 
or parts thereof (i.e., active sites). Homology modeling may be conducted by fitting common 
5 or homologous portions of the protein whose three dimensional structure is to be solved to 
the three dimensional structure of homologous structural elements in the molecule of known 
three-dimensional structure, specifically using the relevant (i.e., homologous) structural 
coordinates provided by Table 2. Homology can be determined a variety of known methods, 
for example, using amino acid sequence identity, homologous secondary structure elements, 
10 and/or homologous tertiary folds. Tesmer et al. (1997) (8) and Druey and Kehrl (1997) (88) 
provide examples of multiple sequence alignments of RSG protein sequences. Homology 
modeling can include rebuilding part or all of a three dimensional structure with replacement 

o 

Molecular replacement analysis as adapted and applied to NMR structural data (as discussed 
J! 5 above) and homology modeling are techniques that are well known in the art which can be 
rU readily applied or adapted to the determination of the three dimensional structures of other 

Ul proteins of the RGS family (and portions thereof, e.g., Ga binding sites and/or allosteric 

^ ! binding sites). These methods are particularly useful for determining RGS solution structure 

O within the conserved region of the protein based on the RGS4 three-dimensional solution 

p^O structure. These methods are applicable to presently known members of the RGS family of 
j£i proteins as well as to proteins, particularly those of RGS sub-family B, as yet unidentified as 

y t RGS proteins, and particularly those that exhibit significant sequence identity above 60% or 

more, preferably 85% or more sequence identity in the RGS-core region. NMR assignments, 
structural coordinates and three-dimensional structures of RGS proteins or peptides, 
25 determined using molecular replacement analysis and homology modeling based on the 

structural coordinated and NMR assignments provided herein and optionally refined using a 
number of techniques well known in the art, can be employed in a similar fashion to the 
structural coordinates of Table 2 for identifying, selecting or designing chemical species that 
are antagonists or agonists of RGS, Ga or RGS Ga complexes. 
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Description of the Structure of RGS4 

Theprimary amino acid sequence of several RGS4 proteins are known. Theamino 
acid sequence of RGS4-core (from rat) with attached hexahis pro tail is listed in Fig. 1 (as 
SEQ ID No. 1). The regular secondary structure elements of free RGS4 were identified from 
a qualitative analysis of sequential and inter-strand NOEs, NH exchange rates, coupling 
constants and the ,3 Cct and t3 Cp secondary chemical shifts (47, 48). The sequential and 
medium NOEs were obtained from a qualitative analysis of the t5 N-edited NOESY and n C- 
edited NOESY spectra. 3 JHNa C0U P lm S constants were obtained from the relative intensity of 
Ha crosspeaks to the NH diagonal in the HNHA experiment (IS). Slowly exchanging NH 
protons were identified by recording an HSQC spectra two hours after exchanging an RGS4 
sample from H2O to D2O. These data, together with the deduced secondary structure 
' elements are summarized in Fig. 1 . 

The overall structure of RGS4 is composed of seven helical regions corresponding to 
residues 7-12 (a,); 17-36 (a*); 40-53 (a 3 ); 61-71 (a 4 ); 86-95 (a 5 ); 105-125 (a 6 ) and 128-132 
(a 7 ). A simple description of the RGS4 topology is that the protein consists of two pseudo 4- 
helix bundles with an up-down-up-down arrangement where helical region six is part of both 
bundles. An unusual feature of the RGS4 structure occurs in the second helical region. There 
is a one residue (H23) -90° bend in the helix which effectively divides this helical region 
into two separate helices (as described in the RGS4 x-ray structure (J)). This one residue 
bend was not obvious from the NMR analysis of the secondary structure data (Fig. 1) where it 
appears to be a continual helical stretch. The bend only became apparent during the structure 
refinement process. Some observable NOEs that contribute to the>bend at H23 occur between 
residues -L20, 121, and residues G26, L27, A29, and F30. The bend at H23 effectively allows 
for appropriate packing of these hydrophobic side-chains. 

Additional bends or turns occur throughout the RGS4 structure. Helical regions a x 
and ce 2 are connected by residue S 1 6 that adopts an extended conformation allowing these two 
helices to be essentially parallel. This is very similar to the turns connecting helical regions 
a 3 and a A and helical regions a 5 and <x 6 . Conversely, helical regions a 2 and a 3 are connected 
by Y38 that has a positive (}) torsion angle, suggesting a P type rum. The conformation of 
Y38 results in an angle between helical regions 04 and a 3 of -45°, which also represents a 
transition point between the two pseudo 4-helix bundles. The longest loop in the structure 
occurs between helical regions a 4 and cc 5 . This loop region is well ordered based on high 



order parameters (S 2 > 0.6). The low mobility for this loop results from interactions with 
helical regions a-, and a 6 . The observed bend between the longest helical region a 6 and the 
shortest helical region ct 7 is suggestive of a distortion in this helical segment to achieve an 
optimal packing interaction between helical regions a, and cc 7 . The end result of these local 
conformations on the overall topology of RGS4 is to create an elongated structure where the 
two pseudo 4-helix bundles are nearly perpendicular. The interface between these the two 
pseudo 4-helix bundles is predominately hydrophobic in nature (LI 7, 121, L27, F30, L34, 
W46, 147, IUO, Fl 1 1, L 11 3, Ml 14) consistent with the general packing of hydrophobic 
residues in the core of the protein with charged residues on the protein surface. 

As previously described, the primary biological function for RGS4 is to bind G ial and 
stimulate its intrinsic GTPase activity. Key residues in the RGS4 structure that are involved 
in the interaction of RGS4 with G Itt] correspond to RGS4 residues S39, E41, N42, L113, 
D117, £>llS,andR!2i thai form the binding pocket for T1S2 from G iKi . Similarly, N82 
from RGS4 binds into the G ittl active site interacting with residues Q204, S206 and E207 (8) 
of G ia . RGS4 mutational work support the functional importance of these residues in the 
binding and activity of RGS4 with G ial while identifying N82 to be critical in facilitating 
GTP hydrolysis (18-20). RGS4 residues S39, E41 and N42 are located in the N-terminal end 
of helical region a 3 while LI 13, Dl 17, S 1 18, and R121 are located directly opposite at the C- 
terminal end of helical region <x 6 . N82 is located approximately in the center of the structured 
loop region between helical regions a 4 and a 5 which is positioned relatively above the T182 
binding pocket on RGS4. 

Another feature of the RGS4 structure is the observation that residues M1-S4 and 
P134-H166 are completely disordered and dynamically flexible. Structure coordinates for 
these atoms are not included in Table 2. This is evident by the sharp line- widths and the 
minimal number of observable NOEs. The flexible nature of these residues are further 
supported by l5 N Tl, T2 and NOE measurements which indicate low order-parameters (S 2 < 
0.6) 

RGS4 Structure Determination 

The final 30 simulated annealing structures were calculated on the basis of 2871 
experimental NMR restraints consisting of 1 960 approximate interproton distance restraints, 
78 distance restraints for 39 backbone hydrogen bonds, 431 torsion angle restraints comprised 
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of 151 <(>, 154 v|/, 97 xi, and 29 X2 torsion angle restraints, 132 3 J NHa restraints and 136 Ca and 
134 Cp chemical shift restraints. Stereo specific assignments were obtained for 58 of the 125 
residues with |3-methylene protons, for the methyl groups of 3 of the 5 Val residues, and for 
the methyl groups of 9 of the 12 Leu residues. In addition, 7 out of the 8 Phe residues and 4 
out of the 5 Tyr residues were well defined making it possible to assign NOE restraints to 
only one of the pair of C5H and CsH protons and to assign a X2 torsion angle restraint. 
Comparison of the Free RGS4 NMR Structure with the RGS4 G ial Bound Structure 

Figs. 2 A and 2B are ribbon diagrams of (A) the x-ray structure of RGS4 complexed to 
Gial (8) and (B) the solution structure of RGS4 as determined by NMR methods. Residues 
that effect significant structural change between the two structures are indicated. An 
unexpected result from determining the solution structure of RGS4 in the absence of G ia! was 
the observation of a significant change in the conformation for free RGS4 relative to RGS4 in 
the complex (5). A fundamental factor in the difference between the two structures is a 
perturbation in the secondary structure elements. Consistent with the RGS4-G iaI x-ray 
structure, the NMR structure of free RGS4 is an a-helical protein comprised of two peudo 4- 
helix bundles. The NMR data shows that free RGS4 is composed of seven helical regions and 
a majority of this data is consistent with the RGS4-G ia , x-ray structure. The significant 
difference between the two secondary structures occurs within the C-terminal helical regions 
a 6 and a 7 . In the RGS4-G ial x-ray structure residues V5 to T132 are observed in (i.e., they 
are ordered) and residues 104-1 16 and 1 19-129 are helical. This contrasts with the free RGS4 
NMR structure where residues 5-133, 105-125 (a 6 ) and 128-132 (a 7 ) are helical. There is a 
significant shift in the helical structure in this region containing residues 104-133. 

The observed structural change between the free RGS4 NMR structure and the RGS4- 
G ial x-ray structure is a movement of a kink between helical regions a 6 and ct 7 towards the C- 
terminus. The movement of this kink results in ct 6 being longer by nine residues and ct 7 being 
shorter by six residues in the free RGS4 NMR structure. Additionally, ct 7 of free RGS4 
extends three residues beyond what was observed as a structural region in the RGS4-G ia , x- 
ray structure. 

The observed change in the secondary structure, although only involving a few C- 
terminal residues, has far-reaching effect, since it results in a significant modification in the 
overall fold for RGS4. This is evident from a 1 .94 A backbone rms difference between the 
RGS4-G iD[1 x-ray structure and the free RGS4 NMR structure for residues 5-134. The major 
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effect of the alteration in secondary structures is a reorganization of the packing of the N- 
terminal and"C-terminal helix as is evident from per-residue backbone atomic rms differences 
between the free RGS4 NMR structure and the bound RGS4 x-ray structure. Therefore, the 
accuracy of the secondary structure interpretation is important for proper analysis of the free 
RGS4 structure. The reliability of the NMR secondary structure is demonstrated by the 
extensive data summarized in Fig. 1. Residues 105-125 and 123-132 show a continual stretch 
of NMR data consistent with an a-helical definition with an abrupt break in this information 
for residues 125-128. Furthermore, the significant differences between the - and C-terminal 
regions of the free RGS4 NMR structure and the bound RGS4 x-ray structure are indicated by 
a large number of interproton distance (145) and torsion angle (39) violations and by the 
corresponding very high values for the NOE and torsion angle restraint energies exhibited by 
the bound RGS4 x-ray structure. The self-consistency of the NMR data using NOEs, 
coupling constants, NH exchange rates and secondary carbon chemical shirts and the large 
number of restraint violations with the bound structure, demonstrate the accuracy of the 
RGS4 NMR" structure provided. Comparisons of the free RGS4 structure of this invention 
with the structure of the RGS4-Gi<xl complex should then provide an accurate description of 
the conformational changes that occur in on RGS4 on binding to GiaL 
Relevance to Activity for the RGS4 Conformational Change 

RGS4 is involved in the regulation of the G ia , GTPase cycle having a modest affinity 
for GTP-Ga, but not binding to GDP-Ga. It is believed that the observed conformational 
changes for free RGS4 are related to modulating its affinity to G iaI to allow for perpetuation 
of the GTPase cycle. This role for the RGS4 conformational change is evident by the fact 
that the RGS4 Get binding site is the location of the G ia , induced structural perturbation. The 
pronounced kink between helical regions ct 6 and a 7 observed in the bound RGS4 x-ray 
structure occurs at residues Dl 17 and Si 18. RGS4 molecular surfaces for both the free 
RGS4 NMR structure and the RGS4-G jal x-ray structure in the vicinity of the G ia! T182 
binding pocket were calculated. A comparison of the two RGS4 molecular surfaces, shows 
that the G iat T182 binding pocket is larger and more accessible in the free RGS4 NMR 
structure. Also, in the RGS4-G ial x-ray structure there appears to be a molecular surface 
"wall" composed of the RGS4 sidechains from residues Dl 17, SI 18 and R121 which 
surround the G iaI T182 binding pocket. These residues form an important hydrogen-bonding 
network which is critical for the binding of RGS4 with G ial where Dl 17 forms a hydrogen 
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bond with Rl 21 and the backbone nitrogen of G iaI T182. The critical nature of these residues 
is further supported by mutagenesis. Alanine mutations of Dl 17 and R121 diminishes RGS4 
activity and binding to G iat . Since the helical kink at residues Dl 17 and SI 18 is less 
pronounced in the free RGS4 NMR structure and a disruption in the helix occurs instead 
between residues 1 25- 1 28, the sidechains for D 1 1 7, S 1 1 8 and R 1 2 1 are well beyond 
hydrogen-bonding distance. It is evident from the free RGS4 NMR structure that the network 
of sidechain interactions with G ial T 182 in the absence of G ial is not pre-formed. 

The observation that RGS4 undergoes a significant structural change in the presence 
of G ial where the focal point of this change occurs at key residues in the RGS4- G ja i interface 
creates a different explanation for the process of RGS4 activation of G ia i GTPase activity. 
This information suggests a two-stage process composed of a binding and locking step. 
Because the G ial T182 binding pocket is clearly more accessible in the free RGS4 NMR 
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locking step then results from the induced conformational change in the RGS4 structure 
where the pronounced kink in the helix between residues Dl 17 and SI 18 brings these 
residues into close contact with R121 and G iol T182 to form the hydrogen bonding network 
observed in the RGS4-G ia i x-ray structure. T 182 in the binding pocket induces the formation 
of a hydrogen bonding network and the resulting RGS4 conformational change as opposed to 
a pre-formed binding site suggested from the RGS4-G ja , x-ray structure. The release of RGS4 
from G ia i would then require the removal of G ial T182 from the RGS4 binding pocket which 
presumably occurs during GTP hydrolysis. This mechanism is consistent with a local 
perturbation in the vicinity of T 182 seen between the GDP- G iol (2) and RGS4- G ial x-ray 
structures where this localized movement appears to be sufficient to remove T182 from the 
RGS4 binding site and disrupt the hydrogen-bonding network resulting in dissociation of the 
complex. Comparison of the Tl 82 G ia , region between the RGS4- G iaI and the GDP-A1F 4 - 
G ia , x-ray structures (4) indicate that these two structures are essentially identical in this 
region of G iat . Since the GDP-A1F 4 -G iat structure corresponds to the active form of GDP- 
A1F 4 -G jal as well as the conformation that RGS4 preferentially binds, the similarity between 
these two structures is also consistent with the proposed mechanism for the activity of RGS4. 

The x-ray structure of RGS4 complexed with G ial in conjunction with other Ga 
conformers suggest that the role of RGS4 in stimulating Ga GTPase activity is accomplished 
by stabilizing the GTP hydrolysis transition state. The NMR structure of free RGS4 reported 



here expands this mechanism suggesting that the RGS4 induced conformation in the presence 
of G io , may -be related to its GTP-G ioK specificity which facilitates binding turnover that is 
critical for perpetuating the GTPase cycle. The described structural change in RGS4 provides 
an elegant mechanism for the observed binding selectivity between the various Get 
conformers despite the close similarity in these structures. 
Detection of an Allosteric binding site in RGS4 

Several small molecule inhibitors of the RGS4-Ga interaction were identified in a 
large scale screening based on detection of inhibition of Get GTPase function which implies 
inhibition of the binding of RGS4 to Ga. One of these compounds (designated compound 1 
for convenience) exhibited 100% inhibition of binding. The nature of the activity of 
compound 1 and its ability to inhibit RGS4 binding to Ga was further investigated by 'H- I5 N 
HSQC chemical shift perturbation experiments. A total of five compounds, three that had 
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in the screen were examined. 2D ! H- 15 N HSQC spectra were collected for a l5 N-enriched 
RGS4 sample and a series of 15 N-enriched RGS4 samples titrated with one of the three test 
compounds and two controls. Comparison of the HSQC spectra of a free RGS4 sample and 
each of the samples titrated with a potential inhibitor allowed the identification of any 
chemical shift changes for RGS4 in the presence of the test and control compounds. In such 
an analysis the observation of a change in the position shape or intensity of a resonance 
indicates perturbation. With the NMR instrumentation employed a shift of half a line width 
in peak position could be reliably detected. Only in NMR spectra taken of RGS4 in the 
presence of compound 1 were any chemical shift perturbations observed indicating that 
compound 1 directly binds to RGS4. Employing the chemical shift assignments for free 
RGS4 (Table 1), the binding site of compound 1 in RGS4 was identified. RGS4 amino acid 
residues V10, W13, 117, 120, H23, E24, C25 and T125 exhibited a chemical shift perturbation 
in the presence of compound 1. 

The observed chemical shift perturbations did not arise from a pH change caused by 
addition of compound 1 to the RGS4 solution. ('H-^N HSQC spectra of free RGS4 laken 
over a pH range of 5.5-6.5 indicate that none of the amino acid residues listed above was 
sensitive to pH changes over this range). The binding site for compound 1 corresponds to 
residues in the al-a2 region of RGS4 (where the al-a2 region includes the tight turn 
between the two helices). In the three-dimensional structure of RGS4, the binding region is 



positioned on the opposite surface from the Ga binding site. No amino acids residues 
associated with the Ga binding site exhibited any chemical shift perturbation in the presence 
of compound 1. This indicates that the structure of the RGS4 Ga binding site is unchanged 
in the presence of compound 1. Compound 1 was found to significantly decrease the 
expected GTPase activity of Ga which combined with the fact that compound 1 binds at a 
site distal from the Ga binding site indicates that compound 1 is an ailosteric inhibitor of 
RGS4 and that there is an ailosteric binding site in the al-a2 region of RGS4. 
It is believed that binding of compound 1 at the ailosteric binding site stabilizes RGS4 in the 
free form and effectively locks the RGS4 protein in free form. Compound 1 prevents the 
formation of a hydrogen-bonding network around the Ga T182 binding pocket. 

The solution structural information provided herein, including the secondary and 
tertiary structure of RGS4-core, the RGS4 Ga binding site, the a6 - a7 region, and the 
ailosteric binding site in the ai-a2 region of RGS4 can all be employed in methods described 
herein and methods well known in the art to identify, select or design candidate agonists and 
antagonists of RGS4 activity which in turn affects G-protein signaling functions in various 
eukaryotic cells and organisms. 

The following examples are provided to further illustrate the invention and are not 
intended to limit the invention. 

EXAMPLES 
The following abbreviations are used herein: 

G-proteins, heterotrimeric guanine nucleotide-binding proteins; RGS4, Regulators of 
G-protein Signaling; G ial , Ga subunit of heterotrimeric G proteins, G ial -AIF 4 ~, Ga subunit of 
heterotrimeric G proteins complexed with Mg 2+ , GDP and A1F 4 * stabilized in the transition 
state for GTP hydrolysis, DTT, DL-l,4-DithiothreitoI; GTP, guanosine triphosphate; GDP, 
guanosine diphosphate; NMR, nuclear magnetic resonance; 2D, two-dimensional; 3D, three- 
dimensional; HSQC, heteronuclear single-quantum coherence spectroscopy; HMQC, 
heteronuclear multiple-quantum coherence spectroscopy; TPPI, time-proportional phase 
incrementation; NOE, nuclear Overhauser effect; NOESY, nuclear Overhauser enhanced 
spectroscopy; COSY, correlated spectroscopy; HNHA, amide proton to nitrogen to CaH 
proton correlation; HNHB, amide proton to nitrogen to CpH proton correlation; CT-HCACO, 
constant time CaH proton to a-carbon to carbonyl correlation; HACAHB, CaH proton to a- 
carbon to CpH proton correlation. 
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Example 1 : Assignment of NMR Peaks and Secondary Structure Determination 

The RGS core domain of RGS4 was expressed in Escherichia coli (J 1 09) using the 
prokaryotic expression vector pQE50 (Qiagin, Valencia, CA). PCR was used to amplify and 
add' a C-terminal hexahis -pro tag to the RGS core (here residues 51-206 of RGS4) and the 
product was ligated between the BamHl and Sail sites of pQE50 to give plasmid pRGS4. 
E. coli (BL21(DE3)) containing pRGS4 were grown in LB broth supplemented with 100 
ug/mL ampicillin. An overnight culture was diluted 1 :20 and grown at 37°C to an A^ of 0.6 
- 0.8 with vigorous shaking. Isopropyl P-D-galactoside (1PTG) was added to a final 
concentration of 1 rruVl and cultures were shaken for 3 h at 37 °C. The cells were harvested 
by centrifugation (7000 x g) for 15 min. at 4°C, washed with PBS and stored at -70°C. 

Uniformly (>95%) and ^C-labeled recombinant RGS4-core (containing the 166 
amino acid core domain of RGS4 with an N-terminal methionine and C-terminal hexahis-pro 
tag) was obtained by growing BL2I (DE3) £. coli in defined medium containing 2.0 g/L 
[ U C6, 98% +] D-glucose and 1.0 g/L [ I5 N, 98%+] ammonium chloride as sole carbon and 
nitrogen sources, respectively. In addition, the defined medium contained M9 salts, trace 
elements, vitamins and 100 Jig/L ampicillin. Conditions for induction and growth are as 
described above. The recombinant RGS4-core protein was purified using affinity 
chromatography on a 10 mL Ni 2+ column and purified to homogeneity following ion- 
exchange chromatography on Resource S at pH 5.5. Protein was desalted into appropriate 
buffer prior to use. N-terminal amino acid sequencing was performed to confirm protein 
identity and uniform labeling of RGS4-core was confirmed by MALDI-TO mass 
spectrometry (Perceptive Biosystems). 

The NMR samples contained 1 mM of RGS4 manually purified-core protein in a 
buffer containing 50mM K 2 P0 4 , 2mM NaN 3) and 50 mM deuterated DTT, in either 90% 
H2O/ 10% D2O or 100% D2O at pH 6.0. 

All spectra were recorded at 30 - 35° C on a Brucker AMX-2 600 spectrometer using 
a gradient enhanced triple-resonance 'H/^C/^N probe. For spectra recorded in H 2 0, water 
suppression was achieved with the WATERGATE sequence and water- flip back pulses (23, 
24). Quadrature detection in the indirectly detected dimensions were recorded with States- 
TPPI hypercomplex phase increment (25). Spectra were collected with appropriate refocusing 
delays to allow for 0,0 or -90,1 80 phase correction. Spectra were processed using the 
NMRPipe software package (28) and analyzed with PfPP (29), NMR Pipe and in a peak 
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sorting program— on a Sun Ultra 10 Workstation. When appropriate, data processing included 
a solvent filter, zero-padding data to a power of two, linear predicting back one data point of 
indirectly acquired data to obtain zero phase corrections, linear prediction of additional points 
for the indirectly acquired dimensions to increase resolution. linear prediction by the means 
of the mirror image technique was used only for constant-time experiments (38). In all cases 
data was processed with a skewed sine-bell apodization function and one zero-filling was 
used in all dimensions. 

The assignments of the 'H, l5 N, l3 CO, and 13 C resonances were based on the following 
experiments: CBCA(CO)NH (62), CBCANH (63), C(CO)NH (64), HC(CO)NH (64), 
HBHA(CO)NH (65), HNCO (66), HCACO (29), HNHA (26), HNCA (67), HCCH-COSY 
(68) and HCCH-TOCSY (69) (for reviews see: Bax et al 1994 and Clore and Gronenborm, 
1994). The resonance assignments of RGS4 essentially followed the semi-automated 
protocol described previously (37, 70, 71). The accuracy of RGS4-core assignment was 
further confirmed by sequential NOEs in the 15 N-edited NOESY-HMQC spectra. Because 
the RGS4 structure is exclusively ct-helical, the sequential NH r NH iTl NOEs were extremely 
useful in completing the RGS4 backbone assignments. l H, 15 N, l3 C AND 13 CO assignments 
for RGS4-core are summarized in Table 1 . 

The backbone 1 H, 15 N, ,3 CO, and i3 C assignments in Table 1 are essentially complete 
for the RGS4-core. As noted above, the native core sequence was appended to six histidines. 
The last five histidines were the only unassigned residues in the protein. The ability to obtain 
the complete assignments for RGS4-core implies a well-packed ordered structure. The side- 
chain assignments are also nearly complete; the majority of missing information is in residues 
with long side-chains which are potentially solvent exposed. 

The secondary structure of the RGS4-core (summarized in Fig. 1) is based on 
characteristic NOE data involving the NH, Ha and Hp protons from 15 N-edited NOESY- 
HMQC and i3 C-edited NOESY-HMQC spectra, 3J HNa coupling constants from HNHA, 
slowly exchanging NH protons and I3 Ccc and 13 CP secondary chemical shifts (for reviews see: 
(56) and (78). It was determined that the RGS4-core solution NMR was composed of seven 
helical regions corresponding to residues 7-l2(ccl);l7-36(cc2); 40-53(a3); 6l-71(a4); 86-95 
(a5); 105-125 (a6); and 128-132 (a7). The RGS4-core overall fold is essentially comprised 
of two 4-helix bundles with the long helical region oc6 part of both bundles.. A distinct 
difference in the RGS4-core secondary structure in solution from the x-ray structure of the 
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RGS4-Giccl complex was unexpectedly observed at the C-terminus. The x-ray structure 
indicates that residues 104-1 16 and 1 19-129 are helical where only residues V5 to Tl-32 are 
observed. The solution NMR structure indicates that residues 105-125 and 128-132 are 
helical and residues P134-H166 appear, in view of the sharp line-widths observed, to be 
extremely mobile. The differences in secondary structure between the x-ray crystal structure 
and that of free RGS4-core suggest a conformational change in RGS4 on binding to Got. 

Example 2: Three-Dimensional Structure Determination for RGS4-core 

RGS4-core was prepared, purified and uniformly labeled as in Example 1 . NMR 
samples were prepared and spectral data accumulated as indicated in Example 1. 

The RGS4 structure is based on the following series of spectra: HNHA (26), HNHB 
(27), 3D long-range 13 C- 13 C correlation (28), coupled CT-HCACO (29, 30), HACAHB- 
COSY (31), 3D **N- (32, 33) and ^C-zdittd NOESY (35, 37) experiments. The ^N-edited 
NOESY, and ^C-edited NOESY experiments were collected with 100 msec and 120 msec 
and mixing times, respectively. 

Spectra were processed using the NMRPipe software package (36) and analyzed with 
PIPP (37) on a Sun UltralO Workstation. When appropriate, data processing included a 
solvent filter, zero-padding data to a power of two, linear predicting back one data point of 
indirectly acquired data to obtain zero phase corrections, linear prediction of additional points 
for the indirectly acquired dimensions to increase resolution. Linear prediction by the means 
of the mirror image technique was used only for constant-time experiments (38). In all cases, 
data were processed with a skewed sine-bell apodization function and one. zero-filling was 
used in all dimensions. 
Interproton Distance Restraints. 

The NOEs assigned from 3D I3 C-edited NOESY and 3D 15 N-edited NOESY 
experiments were classified into strong, medium, weak and very weak corresponding to 
interproton distance restraints of 1.8-2.7 A (1.8-2.9 A for NOEs involving NH protons), 1.8- 
3.3 A (1.8-3.5 A for NOEs involving NH protons), 1.8-5.0 A, and 3.0-6.0 A, respectively (39, 
40). Upper distance limits for distances involving methyl protons and non-stereospecifically 
assigned methylene protons were corrected appropriately for center averaging (41). 
Torsion Angle Restraints and Stereospecific Assignments, 

The p-methylene stereospecific assignments and x\ torsion angle restraints were 
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obtained primarily from a qualitative estimate of the magnitude of 3 J a p coupling 
constants from the HACAHB-COSY experiment (31) and 3 Jnr coupling constants from the 
HNHB experiment (27). Further support for the assignments was obtained from approximate 
distance restraints for intraresidue NOEs involving NH, CctH, and CpH protons (42). 
5 The § and y torsion angle restraints were obtained from 3 JNHa coupling constants 

measured from the relative intensity of Ha crosspeaks to the NH diagonal in the HNHA 
experiment (26), from chemical shift analysis using the TALOS program (43) and from 
consistency with distance restraints for intraresidue and sequential NOEs involving NH, 
CctH, and CpH protons. 1 JcaHa coupling constants obtained from a coupled 3D CT- 
10 HCACO spectrum were used to ascertain the presence of non-glycine residues with positive § 
backbone torsion angles (30). The presence of a ! JcaHa coupling constant greater then 130 
Hz allowed for a minimum, (j) restraint of -2° to -178°. 

The He and Leu X2 torsion angle restraints and the stereospecific assignments for 
leucine methyl groups were determined from 3 JcaCS coupling constants obtained from the 
I=yl5 relative intensity of Ca and CS cross peaks in a 3D long-range 13 C- 13 C NMR correlation 
Jz: spectrum (44), in conjunction with the relative intensities of intraresidue NOEs (45). 

h \ 

Wl Stereospecific assignments for valine methyl groups were determined based on the relative 

e 

p intensity of intraresidue NH-CyH and CaH-CyH NOEs as described by Zuiderweg et al. 

y*! (1985) (46). The minimum ranges employed for the tj>, \\t, and x torsion angle restraints were 

fli 

C? 20 ± 30°, ± 50°, and ± 20° respectively (47). 
y : Structure Calculations 

The structures were calculated using the hybrid distance geometry-dynamical 
simulated annealing method of Nilges et al. (1988) (48) with minor modifications (49) using 
the program XPLOR (50), adapted to incorporate pseudopotentials for 2 J^ Ha coupling 
25 constants (5 1), secondary 13 Ca/ u CfJ chemical shift restraints (52) and a conformational 
database potential (53, 54). The target function that is minimized during restrained 
minimization and simulated annealing comprises only quadratic harmonic terms for covalent 
geometry, 3 J NHa coupling constants and secondary ,3 Cct/ n Cp chemical shift restraints, square- 
well quadratic potentials for the experimental distance and torsion angle restraints, and a 
30 quartic van der Waals term for non-bonded contacts. All peptide bonds were constrained to 
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be planar and trans. There were no hydrogen-bonding, electrostatic, or 6-12 Lennard- Jones 
empirical potential energy terms in the target function. 
Analysis of a T-182 Binding Site on RGS4-core. 

' The overall appearance of the NMR structure in the area of the proposed Tl 82 (of 
Get) binding site is one of great interest. To obtain a more quantitative measurement of the 
differences in accessibility between the free RGS4 NMR structure and the x-ray structure of 
the RGS4-G ial complex, MOLCAD (commercially available from TRJPOST) surfaces were 
calculated for both structures and the surface area of each was measured. 

The x-ray structure of the RGS4-G ial complex (AGR1) was read into SYBYL 
(Tripos) and all substructures except chain E (RGS4) were deleted. Additionally, all waters 
were deleted. Polar hydrogens were added and optimized using the Kollman United Atom 
force field. This was followed by addition of all the remaining hydrogens. MOLCAD was 
then used to generate a surface for all residues thought to be involved in binding of T182 
(Ga-bindmg site). These RGS4-core residues include, 121, 127, F30, F33, L34, E37, S39, 
N42, 143, W46, 1110, LI 13, Ml 14, Dl 17, SI 18, R121. The surface area was calculated 
based on the MOLCAD surface. MOLCAD was also used to calculate the surface area for 
the identical residues of the free RGS4 NMR structure. The surface area for the free RGS4 
NMR structure was calculated to be 404.56 A 2 . The surface area for the crystal structure was 
calculated to be 321.88 A 2 . The difference in surface area of 82.67 A 2 , is an approximate 20% 
change in surface area between the two structures. A MOLCAD surface generated on the 
methyl and hydroxyl groups of Tl 82 of Ga has a surface area of 57.72 A 2 . 

Example 3: Identification of an Allosteric Binding Site in RGS4-Core 
Bead Precipitation Assay for Inhibition of RGS Binding to Ga 

Radiol abled [ 35 S]- Gail was synthesized in a rabbit reticulocyte lysate in vitro 
translation reaaction (Promega, Madison, WI Cat. NO. 14960) programmed with in vitro 
transcribed cRNA preparations (Promega, Cat. No. PI 290). Affinity-purified GST-RGS4 
core (lOOng, about 25 nM final concentration) is incubated with 17.5 uL glutathione- 
Sepharose 4B bead (Amersham Pharmacia, Piscataway, NJ, Cat. NO. 17-0756-01) slurry in 
lOOuL binding buffer (IX PBS, ImM MgCl 2 , ImM DTT, 1% BSA) in a 96-well microtube 
assay plate. Approximately 300uM of test compound (about O.lmg/mL final concentration, 
either as a mixture or individual compound) is added to each well and incubated at 4°C for 30 
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min. Approximately 50,000-100,000 cpm (?) 35 S]-Gctil in lOOuL assay buffer (IX PBS, 
ImM MgCl 2 7 lOuM GDP, IrruM DTT, 30 uM A1C1 3 , 1% BSA, 500 uM NaF) is added to the 
reaction and incubated at 4°C for 30 min. The resulting assay sample has a final 
concentration of about 1-3 nM activated Gail. Reaction plates were centrifuged at lOOOxg 
for 3 min, and the supernatant aspirated. Beads were washed 2x by resuspension in 200 uL 
binding buffer followed by centrifugation. Bound [ 33 S]-Gccilis eluted from the bead pellets 
by resuspending them in lOOuL 1% SDS. Elutates are either counted in 4 mL scintillation 
fluid or subjected to gel electrophoresis. Random small molecules can be evaluated in the 
assay described using a compressed library wherein a plurality of test compounds are 
combined in a single well (e.g., 10 compounds/well for 3000 primary assays tests 30,000 test 
compounds). Mixtures of test compounds that exhibited a greater than 50% decrease in 
precipitated radioactivity were confirmed by re-screening in an identical format. Combi- 
wells (here 10 test compourids/weu) thai tested positive in both assays were deconvolved and 
the individual compounds were tested individually in an identical bead precipitation assay. 
Compounds that demonstrated the requisite decrease (about 50% or more) in precipitated 
radioactivity were further tested to confirm that the decrease in precipitated radioactivity was 
dependent on the RGS4-Ga interaction and not due to spurious activity of the test compound. 
In these cases, the assay precipitate was analyzed by gel electrophoresis to confirm the 
presence of RGS4 in the precipitate. 
! H- 15 NHSOC Chemical Shift Perturbation 

The RGS4 NMR samples contained 0.3mM of RGS4-core protein in a sample buffer 
(50mM KP0 4 , 2mM NaN 3 , and 50mM deuterated DTT in 90% H 2 O/10%D 2 O at pH 6.0). 
Test compounds were added to the sample in 10-fold molar excess. 2D 1H-15N HSQC 
spectra for free RGS4 and RGS4 in the presence of test compounds were collected over a pH 
titration range of 5.5-6.5. The spectral width in the indirectly detected 15 N dimension was 
30.00 ppm with the carrier position at 1 19. 1 ppm. Spectral width in the acquisition 
dimension was 13.44 ppm with the carrier at the water frequency (4.73 ppm). The number of 
points acquired in the two dimensions was 256 complex in F1( I5 N) and 1024 real in ri2('H). 
AH spectra were recorded at 35°C on a Brucker AMX-2 600 spectrometer using a gradient 
enhanced triple resonance ! H/ l3 C/ I5 N probe. Water suppression was achieved in the 
indirectly detected dimension with the WATERGATE sequence and water-flip back pulses 
(23, 24). Quadrature detection in the indirectly detected dimensions were recorded with 
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States-TPPI hypercomplex phase increment (25). Spectra were collected with appropriate 
refocusing delays to allow for 0,0 phase correction, processed using the NMRPipe software 
package (36)) and analyzed with PIPP (37) on a Sun Ultra 10 Workstation. Data processing 
included a solvent filter, a skewed sine-bell apodization function and one zero- filing in all 
dimensions. 

GTPase Functional Assay A single -turn over GTP-ase assay of G-protein a subunits was 
used. In this assay GTPase- induced hydrolysis of [y- 32 P]-GTP results in precipitation of 
radiolabel as 32 Pi. Unhydrolyzed [y- 32 P]-GTP is separated from precipitated label which is 
then counted. Precipitated label 32 Pi is directly proportional to the amount of [y- 32 P]-GTP 
hydrolyzed and to the activity of the Get GTPase . 

Purified [y-32P]-GTP bound Ga is prepared by incubating Ga (2uM) with [y-32P]- 
GTP (2aM) in a reaction buffer (total volume 30 uL), 1 OmM Hepes (pH 8.0), 5 mM EDTA, 
2nuvl DTT, 0.05% C12E10 (Lubrol, ICN Biomedicals, Inc., Aurora, OH), 10 ug /rnL BSA) 
for 30 min at 30° C. Unbound [y- 32 P]GTP is removed using a gel filtration column (Centri- 
Sep, Princeton Separations, Princeton, NJ) according to the manufacturers directions. The 
eluate containing [y- 32 P]GTP bound Ga is collected and the protein is recovered (typically up 
to about SO-90%) after centrifugation at 2000rpm for 2 min at 4° C. 

All steps of the assay are performed at 4° C The purified [y- 32 P]GTP bound Ga 
obtained above is added to 500 f^L of reaction buffer (as above) and separated in to eight 
50uL samples (a zero time control (no initiation) and seven assay time points). The reaction 
is initiated by adding lOuL of 1M MgCl 2 and 10uX of lOmM GTP to the seven assay 
samples. After 10, 20, 30, 40, 60, 90, and 120 seconds, respectively, 750 uL of stop buffer 
(50 mM NaP0 4 (pH 3.0), 5% activated charcoal) is added to one of the assay samples. The 
control and samples are then centrifuged at 100,000 rpm for 10 min to precipitate the charcoal 
and 500 uL of supernatant is remove to assay radiolabel present. GTPase activity is 
expressed as the amount of free [ 32 P]-phosphate released from [y- 32 P]GTP. Phosphate release 
(frnol) = radioactivity (zero time control- time assay) (counts)/specific activity of [y- 
J2 P]GTP. 

GTPase activity of Gai in the presence of RGS4-GST fusion protein was determined 
as described above where GTP hydrolysis by 100 nM Gai was initiated by the addition of 
MgCl 2 in the presence and absence of lOOnM RGS4-GST protein. GTP hydrolysis at the 
indicated time points was calculated as the amount of 32 Pi released (in frnol). The dose- 



dependent effect of RGS4-GST protein on the hydrolysis of GTP-Gai was measured as 
described above in the presence or absence of 10 nM or lOOnM RGS4-GST protein. " 

The effects of test compounds are evaluated for modification of the activity of the 
RGS4 core domain. The RGS4 core protein was generated as a GST-RGS4core fusion using 
standard molecular techniques. Briefly, the core region of RGS4 was obtained using PCR to 
generate a cDNA fragment encoding amino acid 51 (val) to the C-terminal end of the 
protein, amino acid 206 (ala). The 5' forward amplification primer contained an embedded 
BamHI restriction site, followed by nucleotides encoding a flexible linker, Gly-Ser-GIy-Ser, 
prior to the Val residue of rat RGS4. The 3' reverse amplification primer contained a stop 
codon, followed by an embedded BamHI site. • The amplimers were used with pWE2RGS4 
(Shuey et al., 1998 (84) ) as template to generate a PCR product of approximately 625 base 
pairs. This PCR product was BamHI digested, purified and ligated in the BamHI site of 
pGEX-2T (Ammersham Pharmacia, Piscataway NT; to generate pGST-RGS4c recombinant 
plasmid. Plasmid was tranfected into bacterial cells, and DNA prepared by standard methods, 
and confirmed by sequence analysis. GST-RGS4c fusion protein was generated and 
purified according to manufacturer suggestions for expression using the pGEX-2T vector. 

To measure the effect of test compounds on the activity of RGS4 core domain, RGS4- 
GST fusion protein (1.6 \iM) is incubated with test compound (or mixtures of test 
compounds) (30uM-40uM each) or DMSO for 1 hr at 30°C Thereafter, GTPase activity of 
Gai (lOOnM) is measured in the presence or absence of the RGS4-GST treated with the test 
compound (lOOnM). Each assay is replicated at least three-times. RGS4-GST was treated 
with Compound 1 at 30 and inhibited the GTPase activity of IIGS by 30%; while RGS4- 
GST treated with Compound 1 at 300 uM inhibited GTPase activity in comparison to a 
DMSO control. 

Those of ordinary skill in the art will appreciate that reagents, methods, procedures 
and techniques other than those specifically disclosed herein are known in the art and can be 
readily employed or adapted to the practice of this invention to achieve the results of this 
invention. All such art-known functionally equivalent reagents, methods, procedures and 
techniques are intended to be encompassed by this invention. All references cited herein are 
incorporated by reference herein in their entirety to the extent that they are not inconsistent 
with the disclosure herein. 



Table 1 I5 N, l3 C, ^CO and ! H resonance assignments for RGS4 at pH 6.0 and 30^C a 



Residue N 


CO 


Ca 




Others 


Ml 


-(-) 


I /0. 1 


q fA ia\ 
jj.y (4.jU; 




f 3->.y (/.JOJA-E 11. 1 (0.3O) 


R2 


123.0 (8.34) 


t TT 1 
1 / 1.1 


jo. / (4.zy) 


ii n r i (J/t i ta 
33.0 (1.84,1. / /) 


CyZ4.S Vl.45),CwO 2y.2 ( 1.7 !) ,(-£ 42.2 (3.00) 


G3 


1 1 t\ a to a r\\ 

1 10.4 (8.40) 


1 7"* 7 


-l< -3 (a nn\ 






S4 


t t c "i to n\ 

1 15.7 (8.23) 


X /t.u 


JO. I ^H.J^J 






V5 


121.6 (8.20) 


1 /4.V 


£ a i //i io\ 


11 £ f 1 QO^ 


*~y n. f ^u.yo;,zi.y i.u.oy; 


S6 


122.3 (8o6) 


1 / J. 1 




1 (A 11 d. (Y)\ 




Q? 


121.5 (8.95) 


178.2 


58.5 (3.80) 


28.3 (2.02) 


Cy34.3 (2.39) 


E8 


118.9 (8.48) 


1 1 O O 

178.8 


59.7 (3.85) 


29. 1 (2.0 1, 1. 94) 


Cy 3o,o (Z.29) 


E9 


120.5 (7.43) 


177.0 


Do.y ^.77) 


29.4 (2.04,1.72) 


Cy 36. 4 (2.23) 




116.8 (7.16) 


180.8 


65.0 (3.89) 


31.4(1.54) 


Cy22.1 (0.34) ;22. 6 (0.34) 




122.0(7.86) 


179.6 


59.9 (3.92) 


32.1 (1.82) 


Cy 2^.4 (1.52,1.37);C5 29.4 (1.62); 


S3! 










Cs42.1 (2.91,2.79) 


PJ 


120.0(7.39) 


180.4 


59.1 (4.09) 


31.7 (2.12,1.93) 


Cy 25,4 (1.52,1.45); CS 29.2 (1.68); 










Ce 42.0 (2.99) 


&|W13 


120.7 (7.84) 


176.7 


57.2 (4.63) 


29.8 (3.46) 


C51 126.4 (6.67); Nel 129.3(9.69); " 












C^2 113.9(6.25); Crj2 124.0(6.81); 


s 










C^3 121.3(7.18) 


DA14 


1 17.5 (7.48) 


176.9 


52.3 (4.36) 


18.6(1.52) 




C&E15 


117.5 (7.80). 


177.1 


57.6 (4.29) 


30.9(2.16) 


Cy36.6 (2.48,2.25) 


a;si6 


111.6(7.02) 


173.6 


57.5 (4.50) 


65.0 (4.22,3.93) 




€>L17 


125.4 (8.36) 


177.5 


57.1 (3.17) 


39.5 (0.97) 


Cy 26.2 (0.65);C5 25.9 (-0.48);23.9 (-0.08) 


OE18 


117.2 (8.68) 


177.7 


60.1 (3.63) 


29.3(1.98,1.91) 


Cy 36.5 (2.19) 


HN19 


1 16.0 (7.53) 


176.9 


55.3 (4.37) 


38.4 (3,20,2.78) 




L20 


119.5(7.01) 


176.4 


58.2 (3.50) 


41.8 (1.79,1.63) 


Cy 24.7 (0.52);C5 26.9 (-0.03);24.9 (0.33) 


121 


105.6 (7.37) 


175.1 


63.3 (3.65) 


37.1 (1.61) 


Cym 1 9.2(0.3 3) ;Cy (0.70,0.8 1 ) ; 
C5 12.3(-0.62) 


N22 


115.7 (7.10) 


173.8 


53.9 (4.76) 


39.5 (2.88) 




H23 


121.6 (7.32) 


176.0 


56.4 (4.71) 


33.9 (3.31,2.98) 


C62 138.6 (8.15) 


E24 


128.2 (9.04) 


179.5 


60.6 (4.07) 


29.8 (2.09) 


Cy36.1 (2.44,2.34) 


C25 


121.4(10.57) 


176.9 


61.8 (4.46) 


27.3 (3.37) 


y 


G26 


114.1 (7.36) 




46.6 (2.86,2 


.45) 


/ 


L27 


122.8 (8.66) 


178.3 


58.2 (3.76) 


41.4(1.72,1.65) 


Cy27.1 (1.60);C525.6 (1.05); 23.5 (1,04) 


A28 


118.3 (7.29) 


180.8 


55.3 (4.06) 


18.4(1.60) 




A29 


122.8 (7.85) 


178.9 


55.3 (4.51) 


18.2(1.84) 
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F30 


120.0 (8.51) 


17^ 


59.1 (3.94) 


39.5 (2.62,2.38) 


C5 - (634) 




K31 


118.9 (9.02) 


177.5 


60.6 (3.59) 


32.4(1.94,1.74) 


Cy 26.0 (1.70,1.44);C5 - (1.63);Ce - (2.96) 




A32 


* 121.6(7.82.) 


180.3 


55.3 (3.98) 


17.8(1.61) 






F33 


121.5 (7.82) 


177.8 


59.9 (4.39) 


39.2 (3.18,2.00) 


C5-(6/78);Ce-(7.18) 




L34 


122.0 (8.57) 


180.0 


57.6 (3.34) 


40.2 (0.88) 


Cy26.2(1.01);C5 21.8 (0.57); 26.3 (0.19) 




K35 


120 9 (8 60) 


180.1 


59.6 (3.90) 


32.1 (1.79) 


Cy 24.9 ( 1 .54, 1 .39);Ce - (2.93) 




S36 


116.7 (7.43) 


174.1 


61.1 (4.09) 


63.1 (4.04,2.38) 






E37 


1 18 0 (6.86) 


175.3 


55.3 (4.28) 


29.9 (1.98,1.45) 


Cy 35. 9 (1.66) 




Y18 


118 8 (7 60) 


175.8 


58.7 (4.37) 


35.5 (3.27,3.09) 


C5 133.4 (7.04);C£ 1 18.2 (6.84) 




S19 


1 13 0 (8 05) 


175.5 


57 0 (5 04) 


64.5 (3-83,3.71) 






F40 


122 2 (9 07"! 


175 9 


58 2 (3 86) 


30.2 (1.96,1.75) 


Cy36.9 (2.20) 




T^d. 1 
C^f 1 


179 -x no 7?} 


177 7 


Uv.7 ^.-J/ 


27 7 (i 13 1 98} 


Cy35 6 (2 37) 




N47 


1 1 6 4 (1 61) 


176 9 


56 7 C4 5^) 


38.8 (2-68) 






143 


117.6 (6.99) 


176.2 


58.9 (4.52) 


38.3 (1.58) 


Cym 20.7 (1.05); C5 14.6 (0.92) 




D44 


126.2 (8.38) 


179.6 


57.6 (4.59) 


39.1 (2.85) 






F45 


123.8 (8.40) 


176.5 


60.2 (4.37) 


39.0 (2.98,2.50) 


C5 - (6.67) 




W46 


122 3 (1 98) 


177.7 


64.9 (4.39) 


29.9 (3.78,3.21) 


Nel 130.7 (10.68);C^2 115.1(7.25); 
Ct(2 125.8 (7.01);CC3 122.9(7.01); 
Ce3 1 19.9 (7.35) 




14/ 


1 1 R "7 (Q fldl 


17R A. 


oj. l V- 3 * J ' / 


18 6 f"v f)7~i 


fArm 17 0 (0 v5VOyz9 4 1 ^ 24 i.46j; 












CS 14 3 n 07} 


£i 




117 4/7011 


176 6 


62 '8 T4 27} 


- (3 89 3 67} 




C49 


120 4 f7 53} 


179.6 


64.0 (3.90) 


- (3-61) 




P i 


FSO 


120 1 n 39} 


179 1 

X / J m X 


58 8 (3 58) 


29.0 (1.65) 


Cy35.2(-) 




F51 
j-j j i 


1 1 8 3 f 7 93^ 


180.0 


59 0 (3 84) 


29.7 (2.21,1.95) 


Cy36.3 (2.15) 


iF! 

y i 

m 


Y52 


123 0 C8 in 


175 6 


61 2 (3 95) 


38.7 (3.30,2.95) 


C5-(7.03);Ce 118.5 (7.15) 


K53 


1 12 6 (7 43 1 ) 


111 A 


58 2 (3 89) 


32.2 (1.86,1.76) 


Cy -(1^4,1.39) 




K54 


117 S f7 1 R1 


176 0 


S6 1 C4 17} 
JO. 1 y-r. xl) 


11 1 H 98 1 71} 


Cv7S 4 M-C5 29 4 ( 1 60VC£ 42.2 (-) 


P 155 


171 1 f7 19} 


176 8 


64 0 f3 58} 


377 fi 741 


Cym 19 4 (0 75VCY23.3 (0.35) 




K56 


1T£ S f R ~i01 


17£ 7 




11 6 n 9S 1 77} 


r-v74 6 n 40} C5 28 7 (I 64}*Cs 42.1 (3.00) 


m 


S57 


116 7 f7 1 SI 
1 1D.Z, ^ I .j J) 




S4 6 fd 98} 


64 0 (3 19) 




»£i P58 


- f-1 

\ ) 


178 8 


6S 0 f 4 19} 

UJ.U 1-7 J 


32 1 ^2 45 2 06} 


Cy27.5 (2.17,2.06);C8 51.4 (4.11,3.98) 


U 


S59 


1 17 1 1 SI 


17S 5 


60 5 ^4 32} 


62 7 f 3 90} 






K60 


n q 7 n its 


1 "7*7 A 
1 / / .0 




11 7 /7 01 1 89} 


r v 75 7 M-CS 29 0 f-}*C£ 42 4 (-} 




L61 


120.4 (7.49) 


178.5 


COT//* 

58/7 (4.1Z) 


41. J ( l.oZ,!./ lj 


f*-vl6 R ri ^71-^8 7S 0 (CS 741-71 2 (0 35) 




S62 


1 12.7 (8.67) 




64.0 (4.38) 


60.0 (4.Ut5) 






P63 


- \r) 


t rh n 




10 8 <"7 42 1 95} 

J\J.Q ^I.'rijl.yJj 






K64 


1 1 8 0 (1 08} 


178 0 


57 9 (4 36} 


3 1 .9 (2.26) 






A65 


\T> 5 C9 23} 


179 9 


55.5 (4.19) 


17.7 (L76) 


Cy 25.8 (1.43);C5 29.2 (1.68);C£ 42.1 (3.00) 




K66 


1 16 9 (8 40) 


178.4 


60.2 (3.99) 


32.4 (1.93) 




K67 


119.5 (7.52) 


179.6 


59.9 (4.14) 


32.7 (2.09) 


Cy25.2(1.66,l : 52) 




168 


120.9 (8.24) 


178.4 


65.9 (3.87) 


38.3 (2.08) 


Cym 18.3 (0.93); C5 - 0.30 




Y69 


120.8(9.15) 


178.6 


62.4 (3.99) 


39.5 (3.52,3.26) 


C5 133.4 (7.07);C£ 1 17.9 (6r78) 




N70 


117.5 (8.77) 


176.0 


55.7 (4.42) 


38.4 (2.99,2.78) 


Ny 1 12.0 (7.59,6.97) 




E71 


118.4 (7.88) 


176.4 


58.8 (4.00) 


30.6(1.76,1.37) 


Cy 35.9 (1.80,1.09) 
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F72 


110.4 (7.76) 




58.5 (4.95) 


43.4 (2.77) 


C5 13JT(7.75);C£ 133.1 (7.19);C? - (6.89) 




173 


116.5 (7.59) 


175.3 


61.3 (4.05) 


38.0(1.59) 


Cy27.2 (I.43,1.03);Cym 18.5 (0.79); 
CS 12.0 (0.76) 




S74 


113.4 (7.43) 


174.6 


57.7 (3.89) 


64.1 (3.44) 






V75 


120.6 (8.20) 


177.1 


64.4 (3.99) 


31.3 (2.18) 


Cy 20.3 (1.02);2 1.1 (1.02) 




Q76 


118.6(8.10) 


175.4 


55.7 (4.29) 


28.2 (2.23,1.87) 


Cy 34.6 (2.30) 




A77 


124.1 (7.54) 


178.7 


52.8 (4.11) 


18.8 (1.10) 






T78 


114.8 (7.99) 


175.6 


64.0 (4.05) 


68.8 (4.11) 


Cy 22.6 (1.27) 




K79 


123.8 (7.89) 


173.4 


54.6 (4.47) 


33.1 (1.76,1.39) 


Cy25.2(1.27);CS 29.7 (1.64,1.51); 
Ce 42.0 (2.84) 




E80 


117.8 (7.16) 


176.2 


56.6 (3.97) 


30.0(1.82) 


Cy34.5 (2.69,2.19) 




V81 


119.3 (8.22) 


176.3 


59.5 (4.56) 


33.4 (2.01) 


Cv22.5 (0.89); 17.5 (0 34) 




N82 


122.5 (9.09) 


173.7 


53.2 (4.64) 


37.5 (2.89,2.63) 






L83 


122.7 (7.43) 


176.1 


53 1 (4 5?) 


46.3 (1.49,1.34) 


Cy26 4 (-)■ C5 24 3 (0 94) - (0.73) 




D84 


120.3 (8.27) 


176.3 


53.2 (4.66) 


41.7 (3.03,2.79) 






S85 


116.6 (8.93) 


176.7 


62.5 (4.00) 


- (4.01) 






C86 


121.4 (8.36) 


177.9 


62.4 (4.30) 


26.1 (3.07) 






T87 


120.2 (8.35) 


178.5 


67.0 (4.06) 


67.0 (4.06) 


Cy 22.7 (1.22) 




R88 


125.5 (8.55) 


178.2 


61.6 (3.73) 


29.9 (1.98,1.91) 


Cy- (1.51,1.67); C5 43.3 (3.10) 




E89 


mi 1 ro ac\\ 


i in c 


en < (a m\ 


n /n m r\Q\ 






E90 


121.2 (8.44) 


178.5 


59.5 (4.01) 


29 A (2.17) 


Cy 35.6 (2.40) 




T91 


115 1(8 


175 8 


67 8 (3 91) 


68 4 (4 42) 


Cy22 3 (1 43> 


q 


S92 


116.6(8.32) 


176.6 


62A (3.90) 


62.7 (4.09,3-99) 




R j 


R93 


122.3 (7.84) 


179.6 


59.3 (4.05) 


29.9 (1.95,1.91) 


Cy27.9 (1.82,1.60);C5 43.5 (3.23) 




N94 


121.6 (8.10) 


176.7 


54.9 (4.33) 


38.6 (3.18) 




O 
Ul 


M95 


115.7 (7.40) 


176.9 


55.1 (4.07) 


31.3 (1.83,1.77) 


Cy 31.7 (2-29,1.94); Cs 15.3 (1.67) " " 




L96 


119.8 (7.29) 


178.4 


57.7 (4.02) 


41.5 (1.80,1.55) 


Cy27.0 (1.80);C523.0 (0.85); 24.8 (0.95) 


E 


E97 


116.6(7.20) 




53.8 (4.52) 


30.1 (1.92,1.85) 


Cy36.1 (2.10) 


O 

t=3' 


P98 


-(-) 


176.9 


64.4 (4.34) 


32.1 (2.26,1.93) 


Cy27.9 (2.08);C549.9 (3.59) 


CO 


T99 


113.7 (8.43) 


179.1 


59.6 (4.79) 


72.5 (4.60) 


Cy21.1 (1.30) 


Hi 


1100 


123.2 (949) 


176.1 


63.2 (4.22) 


39.6(1.97) 


Cyra 19.1 (1.04);Cy 28.5 (1.47,1.39); 


ifi 












C5 14.8 (0.90) 


o 


T101 


109 8 (1 61) 


176.2 


61.5 (4.57) 


69 3 (4 63) 


Cy217(I" 7 0) 


u 


C102 


122.7 (7-48) 


174.5 


62.6 (3.85) 


28.9 (3.15,2.31) 






F103 


1 12 8 £8 78) 


175.2 


57.8 (4.95) 


40.7 (3.71,2.57) 


C8 - (6.76) 




D104 


122.0 (7.72) 


179.1 


58.9 (4.28) 


39.8 (2.82,2.59) 






E105 


119.9 (8.75) 


175.0 


59.4 (4.16) 


28.6 (2.44,2.12) 


Cy 35.9 (2.42) 




A106 


122.9 (8.66) 


178.8 


55.4 (4.03) 


18.6(1.65) 






Q107 


118.7 (9.58) 


178.3 


60.3 (3.90) 


27.9 (-) 


Cy35.2 (-) 




K108 


120.1 (8.11) 


179.3 


59.6 (4.21) 


32.4 (2.18,2.02) 


Cy25.2 (1.46);C529.8 (1.69);Ce 42.2 (3.00) 




K109 


118.8(8.08) 


180.4 


59.0 (4.27) 


31.8 (2.22,2.04) 


Cy 24.9 (1.44);C5 29.1 (1.71, 1.81); 
Ce42.0 (3.0O) 




1110 


121.3 (8.30) 


177.9 


61.5(4.22) 


36.0 (2.44) 


Cy 29.0 (2.06,1.43);Cym 19.6 (1.38); 
C5 10.0 (0.76) 




Fill 


124.5 (9.22) 


176.8 


63.2 (3.80) 


39.2 (3.51,3.21) 


CS 131.9 (6.87);C£ - (7.04) 
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Nl 12 


117.2 (8.51) 


• 


C/C 1 (A CA\ 

50.5 (4.5U; 


T9. A fX AQ 1 OCW 
5o.O \D.\JO r J..yKJ) 






LI 13 


121.5 (7.95) 


180.3 


CO 1 (A O 1 \ 




Cy27.1 (i.83);C5 24.9(1.06) 




M114 


118.6(8.22) 


177.0 


OU.U (4.23) 


1 < A f 1 01 1 A.Q\ 


Cy32.0 (2.38^.26);C£ 15.9 (1.73) 




El 15 


120.2 (8.46) 


177.3 


60.4 (3.49) 


29.6 (1.58) 


Cy 37.3 (2.07) 




K116 


143.5 (7.57) 


176.9 


57.5 (4.26) 


33.1 (1.98) 


Cy24.7 (1.59);C5 29.5 (I.72);Ce42.3 (3.04) 




D117 


116.6 (7.62) 


176.6 


ceo t c rto\ 

55.8 (5.08) 


42.8 (2.92,2.84) 


- 




Si 18 


117.2 (8.45) 


176.2 


03. U 


A1 A fA 11 A m\ 






Y119 


121.8 (8.95) 


175.5 


OU-4 (4.2/) 


3y.2 (2.ol,2.UU/ 


C8-(6.72);Ce 117.2 (6.58) 




R120 


114.4 (7.11) 


179.8 


CO i fl <~>\ 

55.3 (3.52) 


H ft Ci£. 1 QA\ 

2o. / (i.yo f i-oO) 


Cy27.0 (L87,1.64);C5 42.9 (3.16,3.10) 




R121 


117.5 (7.63) 


179.4 


59.6 (3.97) 


*2A "2 /I A"7\ 

30.3 (2.07) 


C5 43.1 (-) 




F122 


124.0 (8.57) 


178.4 


60.0 (3.32) 


jo.3 (2.89,2.76) 


C5 - (6.77) 




U23 


117.7 (7.05) 


176.3 


56.3 (3.27) 


41.4 (1.11,1.06) 


Cy25.6 (1.3I);C525.2 (0.32);I9.6 (-0.09) 




K124 


114.3 (6.72) 


175.5 


54.8 (4.42) 


33.0 (2.09,1.63) 


Cy 24.8 ( 1 .46);C5 28.9 ( 1 .70);Ce 42.2 (2.97) 




S125 


118.2 (7.83) 


175.6 


57.9 (4.79) 


o/.l (4.52) 






R126 


121.1 (9.39) 


176.5 


58.2 (3.92) 


28.4 (1.81) 


Cy25.5 (1.60,1.43); C5 42.5 (3.05,2.97) 




F127 


115.5 (7.47) 


176.0 


60.7 (4.21) 


39.5 (3.42,3.27) 


C5-(7.18) 




Y128 


114.2 (6.68) 


177.2 


^ f\ *y /A A t~W 

59.3 (4.40) 


39.8 (2.39) 


C5 133.0 (7.17);Ce 1 17.6 (6.91) 




L129 


116.8 (8.34) 


180.4 


57.6 (3.71) 


41.7 (1.41,1.37) 


Cy 26.8 (1.76);C5 25.6 (0.74);22.4 (0.89) 




D130 


119.0 (8.61) 


177.8 


56.8 (4.38) 


j9.8 (2.63) 






L131 


117.5 (7.46) 


177.8 


55./ (4.18) 


42.1 (1.33,1.43) 


Cy 22.9 (i.27);C5 22.4 (0.94);22.6 (0.33) 


o 


T132 


108.3 (7.24) 


173.9 


61.6 (4.26) 


70.1 (4.09) 


Cy 2 1.6 (0.73) 




N133 


1 19.8 (7.60) 


173.0 


51.3 (4.95) 


38.8 (2.78,2.62) 






P134 


-(-) 


177.4 


63.7 (4.43) 


32.1 (2.27,1.96) 


Cy 27.2 (L97);C5 50.6 (3.67,3.50) . 


n i 


S135 


115.5 (8.32) 


174.9 


58.8 (4.44) 


63.7 (4.42,3.88) 






S136 


117.6 (8.22) 


174.7 


58.5 (4.47) 


63.8 (3.90,3.13) 




i rj 


C137 


120.7 (8.27) 


175.1 


58.7 (4.55) 


28.1 (2.95) 




irj 


G138 


111.5 (8.43) 


173.8 


45.4 (3.96) 
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Table 1 continued 
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Cy33.9 (2.41) 
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Footnotes to Table S 1 

a In each column, I5 N and 13 C shifts are listed first, and the corresponding *H shifts are given in 
parentheses. J H and 13 C chemical shifts are reported relative to 3^trimethylsily)propionic-d4 acid and 
l5 N shifts relative to external iiguid NH3. 
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Table 2 

Restrained Minimized NMR Coordinates for Free RGS4 
Pages 42-70 



The structural coordinates herein are deposited with Brookhaven Protein Database. 

(Brookhaven National Laboratory) 
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